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ABSTRACT 



New cytochrome P-450 dependent monooxygenases and 
DNA molecules encoding these monooxygenases are 
provided, which are able to catalyze the biosynthetic path- 
way from amino acids to their corresponding cyanohydrins, 
the presursors of the cyanogenic glycosides, or to glucosi- 
nolates. Moreover, the invention provides methods for 
obtaining DNA molecules according to the invention and 
methods for obtaining transgenic plants resistant to insects, 
acarids, or nematodes or plants with improved nutritive 
value. 

37 Claims, No Drawings 
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CYTOCHROME P-450 MONOOXYGENASES 

This is a divisional of application Ser. No. 08/656,177, 
filed Aug. 8, 1996, now U.S. Pat. No. 5,882,851 which is a 
§ 371 of PCT/EP94/03938, filed Nov. 28, 1994 (published 5 
Jun. 15, 1995, as WO 95/16041), which claims priority of 
European Patent Application No. 93810860.2, filed Dec. 8, 
1993. 

The present invention relates to genetic engineering in 
plants using recombinant DNA technology in general and to lO 
enzymes involved in the biosynthesis of cyanogenic glyco- 
sides and glucosinolates and genes encoding these enzymes 
in particular. The proteins and genes according to the 
invention can be used to improve the nutritive value or pest 
resistance of plants, 15 

Cyanogenic glycosides constitute secondary plant 
metabolites in more than 2000 plant species. In some 
instances they are the source of HON which can render a 
plant toxic if it is taken as food. For example the tubers of 
the cyanogenic crop cassava (Manihot esculenta) constitute 20 
an important staple food in tropical areas. However, the 
cyanogenic glycosides present in the tubers may cause 
cyanide poisoning in humans due to insufiBciently processed 
cassava products. Other plant species whose enzymatic 
production of HON accounts for their potential toxicity if 25 
taken in excess as food or used as animal feed include white 
clover (Trifolium repens)^ sorghum (Sorghum hicolor), linen 
flax (Linum usitatissimum)^ triglochinin {Triglochin 
maritima), lima beans (Phaseolus lunatus), almonds 
(Amygdalus) and seeds of apricot (Prunus), cherries and 30 
apple (Malus). The toxic properties could be reduced by 
blocking the biosynthesis of cyanogenic glycosides in these 
plants. 

The primary precursors of the naturally occuring cyano- 
genic glycosides are restricted to the five hydrophobic 35 
protein amino acids valine, leucine, iso leucine, phenylala- 
nine and tyrosine and to a single non -protein amino acid, 
cyclopentenylglycine. These amino acids are converted in a 
series of reactions to cyanohydrins which are ultimately 
linked to a sugar residue. Amygdalin for example constitutes 40 
the O-p-gentiobioside and prunasin the O-p-glucoside of 
(R)-mandelonitrile. Another example of cyanogenic glyco- 
sides having aromatic aglycones is the epimeric pair of the 
cyanogenic glycosides dhurrin and taxiphyllin which are to 
be found in the genus Sorghum and Taxus, respectively. 45 
p-Hydroxymandelo-nitrile for example is converted into 
dhurrin by a UDPG-glycosyltransf erase. Similiar glycosyl- 
transferases are believed to be present in most plants. 
Vicianin and lucumin are further examples for disaccharide 
derivatives similiar to amygdalin. Sambunigrin contains 50 
(S)-mandelonitrile as its aglycone and is therefore epimeric 
to prunasin. 

Examples of cyanogenic glycosides having aliphatic 
aglycones are linamarin and lotaustralin found in clover, 
linen flax, cassava and beans. A detailed review on cyano- 55 
genie glycosides and their biosynthesis can be found in 
Conn, Naturwissenschaften 66:28—34, 1979, herein incor- 
porated by reference. 

The biosynthetic pathway for the cyanogenic glucoside 
dhurrin derived from tyrosine has been extensively studied 60 
(Halkier et al, 'Cyanogenic glucosides: the biosynthetic 
pathway and the enzyme system involved' in: 'Cyanide 
compounds in biology', Wiley Chichester (Ciba Foundation 
Symposium 140), pages 49-66, 1988; Halkier and Moller, 
Plant Physiol. 90:1552-1559, 1989; Hallder et al. The J. of 65 
Biol. Chem. 264:19487-19494, 1989; Halkier and Moller, 
Plant Physiol. 96:10-17, 1990, Halkier and MoUer, The J. of 



.417 

2 

Biol. Chem. 265:21114-21121, 1990; Halkier et al, Proc. 
Natl. Acad. Sci. USA 88:487-491, 1991; Sibbesen et al, in: 
'Biochemistry and Biophysics of cytochrome P-450. Struc- 
ture and Function, Biotechnological and Ecological 
Aspects', Archakov, A. I. (ed.), 1991, Koch et al, 8th Int. 
Conf. on Cytochrome P450, Abstract PI 1. 053; and Sibbesen 
et al, 8th Int. Conf. on Cytochrome P450, Abstract PII.016). 
It has been found that L-Tyrosine is converted to p-hydroxy- 
mandelonitrilc, the precursor of dhurrin with 
N-hydroxytyrosine and supposedly N,N-dihydroxytyrosine, 
2-nitroso-3-(p-hydroxyphcnyl)propionic acid, (E)- and (Z)- 
p-hydroxyphenylacetaldehydoxime, and 
p-hydroxyphenylacetonitrile as key intermediates. Two 
monooxygenases dependent on cytochrome P-450 have 
been reported to be involved in this pathway. A similiar 
pathway also involving cytochrome P-450 dependent 
monooxygenases has been demonstrated for the synthesis of 
linamarin and lotaustralin from valine and isoleucine respec- 
tively in cassava (Koch et al. Archives of Biochemistry and 
Biophysics, 292:141-150, 1992). 

It has now surprisingly been found that the complex 
pathway from L-tyrosine to p-hydroxy-mandelonitrile sum- 
marized above can be reconstituted by two enzymes only, 
which turn out to be identical to the cytochrome P-450 
dependent monooxygenases. This result is very surprising 
given the high degree of complexity of the pathway reflected 
by its numerous intermediates. Tlius the two cytochrome 
P-450 monooxygenases are multifunctional. A first enzyme, 
designated P-450y^, converts the parent amino acid to the 
oxime. A second enzyme, designated P-450y.^, converts the 
oxime to the cyanohydrin. Multifunctional cytochrome 
P-450 enzymes have not previously been found and 
described in plants. 

Glucosinolates are hydrophilic, non-volatile thioglyco- 
sides found within several orders of dicotyledoneous 
angiosperms (Cronquist, 'The Evolution and Classification 
of Flowering Plants, New York Botanical Garden, Bronx, 
1988). Of greatest economic significance is their presence in 
all members of the Brassicaceae (order of Capparales), 
whose many cultivars have for centuries provided mankind 
with a source of condiments, relishes, salad crops and 
vegetables as well as fodders and forage crops. More 
recently, rape (especially Brassica napus and Brassica 
campestris) has emerged as a major oil seed of commerce. 
About 100 different glucosinolates are known possessing the 
same general structure but differing in the nature of the side 
chain. Glucosinolates are formed from protein amino acids 
either directly or after a single or multiple chain extension 
(UnderhUl et al, Biochem, Soc. Symp. 38:303-326, 1973). 
N-hydroxy amino acids and aldoximes which have been 
identified as intermediates in the biosynthesis of cyanogenic 
glycosides also serve as eflScient precursors for the biosyn- 
thesis of glucosinolates (Kindl et al, Phytochemistry 
7:745-756, 1968; Matsuo et al, Phytochemistry 11:697-701, 
1972; Underbill, Eur. J. Biochem. 2:61-63, 1967). 

It has now surprisingly been found that the cytochrome 
P-450^ involved in cyanogenic glycoside synthesis is very 
similiar to the corresponding biosynthetic enzyme in glu- 
cosinolate synthesis. 

The reduction of the complex biosynthetic pathway for 
cyanohydrins described above to the catalytic activity of 
only two enzymes, cytochrome P-450;r and P-450y/, allows 
the introduction of the biosynthetic pathway of dhurrin into 
plants, which plants in their wildtype phenotype do not 
normally produce cyanogenic glycosides. By transfection of 
gene constructs coding for one or both of the two cyto- 
chrome P-450 monooxygenases it will be possible to either 
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reconstitute or newly establish a biosynthetic pathway for 
cyanogenic glycosides. It is therefore an object of the 
present invention to provide genes coding for cytochrome 
P-450 monooxygenases active in the biosynthesis of cya- 
nogenic glycosides. 5 

The introduction of a biosynthetic pathway for cyano- 
genic glycosides into plants by methods known in the art, 
which in their wildtype phenotype do not express these 
glycosides is of great interest. This is due to the surprising 
finding of the present invention that cyanogenic glycosides lo 
can be toxic to insects, acarids, and nematodes. Therefore, 
the introduction or reconstitution of a biosynthetic pathway 
for cyanogenic glycosides in plants or certain plant tissues 
will allow lo render plants toxic to insects, acarids or 
nematodes and thus help to reduce the damage to the crop by 15 
pests. In combination with other insecticidal principles such 
as Bacillus thuringiensis endotoxins the damage to the crop 
by pests could be even further reduced. 

Alternatively, the sequences of the genes encoding the 
monooxygenases according to the invention can be used to 20 
design DNA plasmids which upon transfection into a plant 
containing cyanogenic glycosides such as cassava, sorghum 
or barley eliminate cyanogenic glycosides normally pro- 
duced in wildtype plants. This can be achieved by expres- 
sion of antisense or sense RNAor of ribozymes as described 25 
in EP-458 367 Al, EP-240 208-A2, U.S. Pat. No. 5,231,020, 
WO 89/05852, and WO 90/11682 which RNA inhibits the 
expression of monooxygenases according to the invention. 
This is of great interest as in spite of numerous efforts it has 
not been possible through traditional plant breeding to 30 
completely remove the cyanogenic glycosides from for 
example cassava and sorghum. On the other hand it has been 
shown that elevated amounts of cyanogenic glycosides in 
the epidermal ceUs of barley cultivars confer increased 
sensitivity to attack by the mildew £uugas Erysiphe graminis 35 
(Pourmohensi, PhD thesis, Gottingen, 1989; Ibenthal et al, 
Angew. Bot. 67:97-106, 1993). A similiar effect has been 
observed in the cyanogenic rubber tree Hevea brasiliensis 
upon attack by the fungus Microcyclus ulei (Lieberei et al. 
Plant Phys. 90:3-36, 1989) and with flax attacked by Col- 40 
letotrichum lini (Liidtke et al, Biochem. Z. 324:433442, 
1953). In these instances the quantitative resistance of the 
plants stipulated above and of other plants, where cyano- 
genic glycosides confer increased sensitivity to attack by 
microorganisms, can be increased by preventing the produc- 45 
tion of cyanogenic glycosides in such plants. In barley, the 
cyanogenic glycosides are located in the epidermal cells. 
The antisene, sense or ribozyme constructs are therefore 
preferably but but not necessarily combined with an epider- 
mis specific promoter. 50 

The presence of even minor amounts of cyanogenic 
glycosides in plants may also cause nutritional problems due 
to generation of unwanted carcinogens as demonstrated in 
barley. Barley malt for example contains low amounts of the 
cyanogenic glucoside epiheterodendrin which in the cause 55 
of production of grain-based spirits can be converted to 
ethylcarbamate which is considered to be a carcinogen. 
Attempts are being made to introduce mandatory maximum 
allowable concentrations of ethylcarbamate in fermented 
food, beverages and spirits (Food Chemical News 29:33.35, 60 
1988). 

Plants containing cyanogenic glycosides typically con- 
tain only a single cyanogenic glycoside or just a few. In 
certain cases it is of interest to alter the cyanogenic glycoside 
profile of a plant. Since cytochrome P-450^^ shows broad 65 
substrate specificity this enzyme typically converts the 
aldoxime produced by cytochrome P-450y into the corre- 



sponding cyanohydrin. Alteration of the chemical identity of 
cyanogenic glycosides produced in a specific plant can thus 
be accomplished by transforming a plant with an additional 
gene encoding an expressible cytochrome P-45O7 monooxy- 
genase with a substrate specificity different from the natu- 
rally occuring enzyme. 

l^e present invention relates primarily to a DNA mol- 
ecule coding for a cytochrome P-450 monooxygenase, 
which catalyzes the conversion of an amino acid to the 
corresponding N-hydroxy amino acid and the oxime derived 
from this N-hydroxy amino acid Preferably the inventive 
monooxygenase catalyzes the conversion of an amino acid 
selected from the group consisting of tyrosine, 
phenylalanine, tryptophan, valine, leucine, isoleucine and 
cyclopentenylglycine or an amino acid selected from the 
group consisting of L-tyrosine, L- valine and L-iso leucine. 
Additionally the present invention relates to a DNA mol- 
ecule coding for a cytochrome P-450 monooxygenase, 
which monooxygenase catalyzes the conversion of said 
oxime to a nitrile and the conversion of said nitrile to the 
corresponding cyanohydrin. The DNA molecules according 
to the invention either correspond to naturally occuring 
genes or to functional homologues thereof which are the 
result of mutation, deletion, truncation, etc. but still encode 
cytochrome P-450 monooxygenases, which either catalyze 
the conversion of an amino acid to the corresponding 
N-hydroxy amino acid and the oxime derived from this 
N-hydroxy amino acid, or the conversion of said oxime to a 
nitrile and the subsequent conversion of said nitrile to the 
corresponding cyanohydrin. Both monooxygenases are able 
to catalyze more than one reaction of the biosynthetic 
pathway of cyanogenic glycosides but preferably contain a 
single catalytic center. The monooxygenase cytochrome 
P-450, converting the parent amino acid is also involved in 
glucx)sinolate biosynthesis. Because cytochrome P-450^ 
determines the substrate specificity and thus the type of 
glucosinolates produced and because cytochrome P-450y^ 
constitutes the rate limiting step, the principles already 
described above for cyanogenic glucosides can also be used 
to down- or up-regulate the synthesis of glucosinolates in 
glucosinolate producing plants and to alter the compositon 
of glucosinolates produced. 

The inventive DNA molecule encoding cytochrome 
P-45O7 is obtainable from plants which produce cyanogenic 
glycosides and glucosinolates. These plants include but are 
not limited to plants selected fi-om the group consisting of 
the species Sorghum, Trifoliun, Linum, Taxus, Triglochin, 
Mannihot, Amygdalus and Prunus as well as cruciferous 
plants. In a preferred embodiment of the invention the DNA 
molecule is obtained from Sorghum bicolor (L.) Moench or 
Manihot esculenta Crantz. The sequence similarity between 
cytochrome P-450^ monooxygenases from different plants 
producing cyanogenic glycosides or glucosinolates is evi- 
denced by the specific cross- re activity of antibodies pre- 
pared against cytochrome P-4507^ isolated from sorghum, 
with the corresponding cytochrome P-450 enzyme in cas- 
sava and with the corresponding enzyme in the glucosinolate 
producing plant Tropaeolum majus. Southern blotting using 
the cDNA clone encoding cytochrome ^-4^50-^ shows 
specific and strong hybridization to genomic DNA isolated 
fi-om cassava, Tropaeolum majus, and rape. Of all approxi- 
mately 250 known published sequences for cytochrome 
P-450 enzymes, cytochrome P-450jto shows the highest 
sequence similarity to the petunia 3'5'-flavonoid hydroxylase 
(30,8%) and 28% sequence similiarity to CYP1A2 from 
rabbit. The group of cytochrome P-450/ monooxygenases 
functionally characterized by catalyzing the conversion of 
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an amino acid to the corresponding aldoxime can thus be 
defined as cytochrome P-450 enzymes the amino acid 
sequence of which exhibits a 32% or higher sequence 
similarity and preferably a 40% or higher sequence similar- 
ity to that of cytochrome P-4507^ Cytochrome P-450 gene 
protein families are defined as having less than 40% amino 
acid identity to a cytochrome P-450 protein from any other 
family. Consequently, cytochrome P-450j^ belongs to a 
new P-450 protein family. 

The inventive DNA molecule encoding cytochrome 
P-450yy is obtainable from plants which produce cyanogenic 
glycosides. In a preferred embodiment of the invention the 
DNA molecule is obtained from Sorghum bicolor (L.) 
Moench or Manihot esculenta Crantz. The enzyme isolated 
from Sorghum bicolor (L.) Moench is designated cyto- 
chrome P-450^^, The catalytic properties of this enzyme 
resembles those of a cytochrome P-450 activity reported in 
microsomes from rat liver (DeMaster et al, J. Org. Chem. 
5074-5075, 1992) which has neither been isolated nor 
further characterized. A characteristic of cytochrome 
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MDLADIPKQQRLMAGNALVV (SEQ ID NO: 12). 

For other cytochrome P-450jj enzymes, the N-terminal 
sequences may be different. 

Optionally, a P-450 jj monooxygenase might also com- 
prise one of the following sequences: 
-ARLAEIFATII- (SEQ ID NO: 13) 
-EDFTVTTK- (SEQ ID NO: 14) 
-QYAALGSVFTVPII- (SEQ ID NO: 15) 
-XXPFPI- (SEQ ID NO: 16). 

Another embodiment of the present invention deals with 
a method for the preparation of cDNA coding for a cyto- 
chrome P-450 monooxygenase, which either catalyzes the 
conversion of an amino acid preferably selected from the 
group consisting of tyrosine, phenylalanine, tryptophan, 
valine, leucine, isoleucine and cyclopentenylglycine, to the 
corresponding N-hydroxy amino acid and the oxime derived 
from this N-hydroxy amino acid (cytochrome P-450/); or the 
conversion of said oxime to a nitrile and the conversion of 
said nitrile to the corresponding cyanohydrin (cytochrome 
P-450yy); comprising 



P-450o^ and of other members belonging to the cytochrome 20 (a) isolating and solubilizing microsomes from plant tissue 



family is that dehydration of the oxime to the 
corresponding nitrile is dependent on the presence of 
NADPH but that this dependence can be overcome by the 
addition of sodium dithipnite or other reductants. Cyto- 
chrome P-450 enzymes able to convert aldoximes into 
cyanohydrins might be present in most living organisms. 

For the purposes of gene manipulation using recombi- 
nant DNA technology the DNA molecule according to the 
invention may in addition to the gene coding for the 



producing cyanogenic glycosides or glucosinolates, 

(b) purifying the cytochrome P-450 monooxygenase, 

(c) raising antibodies against the purified monooxygenase, 

(d) probing a cDNA expression library of plant tissue 
25 producing cyanogenic glycosides or glycosinolates with 

said antibody, and 

(e) isolating clones which express the monooxygenase. 
Microsomes can be isolated from plant tissues which 

show a high activity of the enzyme system responsible for 
monooxygenase comprise DNA which allows for example 30 biosynthesis of the cyanogenic glycosides. The.se tissues 



replication and selection of the inventive DNA in microor- 
ganisms such as E. coli, Bacillus, Agrobacterium, Strepto- 
myces or yeast. It may also comprise DNA which allows the 
monooxygenase genes to be expressed and selected in 
homologous or heterologous plants. Such sequences com- 
prise but are not limited to genes whose codon usage has 
been adapted to the codon usage of the heterologous plant as 
described in WO 93/07278; to genes conferring resistance to 
neomycin, kanamycin, methotrexate, hygromycin, 
bleomycin, streptomycin, or gentamycin, to 
aminoethy Icy stein, glyophosphate, sulfonylurea, or phos- 
phinotricin; to scorable marker genes such as galactosidase; 
to its natural promoter and transcription termination signals; 
to promoter elements such as the 35S and 19S CaMV 



may be different from plant species to plant species. A 
preferred source of microsomes are freshly isolated shoots 
harvested 1 to 20 days, preferably 2 to 10 days and most 
preferably 2 to 4 days after germination. Etiolated seedlings 
35 are preferred from plant producing cyanogenic glycosides 
but light grown seedlings may also be used. Following 
isolation the microsomes are solubilized in buffer containing 
one or more detergents. Preferred detergents are RENEX 
690 (J. Lorentzen A/S, Kvistgard, Denmark), reduced Triton 
40 X-100 (RTX-100) and CHAPS. 

The cytochrome P-450 monooxygenases can be purified 
applying standard techniques for protein purification such as 
ultracentrifugation, fractionated precipitation, dialysis, 
SDS-PAGE and column chromatography. Possible columns 



promoters, or tissue specific plant promoters such as pro- 45 comprise but are not limited to ion exchange columns such 



meters specific for root (described for example in EP-452 
269-A2, WO 91/13992, U.S. Pat. No. 5,023,179), green 
leaves such as the maize phosphoenol pyruvate carboxylase 
(PEPC), pith or poUen (described for example in WO 
93/07278), or inducible plant promoters (EP 332 104); and 
to heterologoxLs transcription termination signals. 

The present invention also relates to monooxygenases 
which catalyze the conversion of an amino acid preferably 
selected from the group consisting of tyrosine, 
phenylalanine, tryptophan, valine, leucine, isoleucine and 
cyclopentenylglycine to the corresponding N-hydroxyamino 
acid and the oxime derived from this N-hydroxyamino acid 
(cytochrome P-450/); or the conversion of said oxime to a 
nitrile and the conversion of said nitrile to the corresponding 
cyanohydrine (cytochrome P-450//). In a preferred embodi- 
ment of the invention the monooxygenases are purified and 
can be used to establish monoclonal or polyclonal antibodies 
which specifically bind to the monooxygenases. 

In another preferred embodiment of the invention the 
cytochrome P-450// monooxygenase can be isolated from 
Sorghum, has a molecular weight of 5 1 kD as determined by 
SDS-PAGE and comprises the N-terminal sequence 
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60 



65 



as DEAE Seph arose. Reactive dye columns such as Ciba- 
cron yellow 3 agarose, Cibacron blue agarose and Reactive 
red 120 agarose, and gel filtration columns such as 
Sephacryl S-1000. The cytochrome P-450 content of the 
individual fractions can be determined from carbon monox- 
ide difference spectra. 

The purified proteins can be used to elicit antibodies in for 
example mice, goats, sheeps, rabbits or chickens upon 
injection. 5 to 50 ^g of protein are injected several times 
during approximately 14 day intervals. In a preferred 
embodiment of the invention 10 to 20 jUg are injected 2 to 6 
times in 14 day intervals. Injections can be done in the 
presence or absence of adjuvants. Immunoglobulins are 
purified from the antisera and spleens can be used for 
hybrid oma fusion as described in Harlow and Lane, 'Anti- 
bodies: A Laboratory Manual*, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., 1988, herein incor- 
porated by reference. Antibodies specifically binding to a 
cytochrome P-450 monooxygenase can also be used in plant 
breeding to detect plants producing altered amounts of 
cytochrome P-450 monooxygenases and thus altered 
amounts of cyanogenic glycosides. 
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The methods for the preparation of plant tissue cDNA 
libraries are extensively described in Sambrook et al. 
Molecular cloning: A laboratory manual. Cold Spring Har- 
bor Laboratory Press, Cold Spring Harbor, N.Y., 1989, the 
essential parts of which regarding preparation of cDNA 5 
libraries are herein incorporated by reference. PolyA'*' RNA 
is isolated from plant tissue which shows a high activity of 
the enzyme system responsible for biosynthesis of the 
cyanogenic glycosides or glucosinolates. These tissues may 
be different from plant species to plant species. A preferred 
tissue for poly A* RNA isolation is the tissue of freshly 
isolated shoots harvested 1 to 20 days, preferably 2 to 10 
days and most preferably 2 to 4 days after germination. 
When cDNA libraries are made from glucosinolate produc- 
ing plants older or mature plant tissue may also be used. The 
obtained cDNA libraries can be probed with antibodies 
specifically binding the cytochrome P-450 monooxygenase 
and clones expressing the monooxygenase can be isolated. 

An alternative method for the preparation of cDNA cod- 
ing for a cytochrome P-450 monooxygenase comprises 

(a) isolating and solubilizing microsomes from plant tissue 20 
producing cyanogenic glycosides or glucosinolates, 

(b) purifying the cytochrome P-450 monooxygenase, 

(c) obtaining a complete or partial protein sequence of the 
monoxygenase, 

(d) designing oligonucleotides specifying DNA coding for 4 25 
to 15 amino acids of said monooxygenase protein 
sequence 

(e) probing a cDNA library of plant tissue producing cya- 
nogenic glycosides or glucosinolates with said 
oligonucleotides, or DNA molecules obtained from PCR 30 
amplification of cDNA using said oligonucleotides, and 

(f) isolating clones which encode cytochrome P-450 
monooxygenase. 

Amino acid sequences of internal peptides which are the 



Alternatively, total DNA from the DNA library, preferably 
from the cDNA library, can be prepared and used as a 
template for a PCR reaction with primers representing low 
degeneracy portions of the amino acid sequence. Preferably, 
the primers used will generate PCR products that represent 
a significant portion of the nucleotide sequence. The PCR 
products can be further probed to determine if they corre- 
spond to a portion of the P-450 monooxygenase gene using 
a synthetic oligonucleotide probe corresponding to an amino 
acid fragment sequence located in the interior or middle 
region of the P-450 monooxygenase protein. 

The cDNA clones and PCR products prepared as 
described above or fragments thereof may be used as a 
hybridization probe in a process of identifying further DNA 
sequences from a homologous or a heterologous source 
organism encoding a protein product that exhibits P-450 
monooxygenase activity such as, for example, a fungi or a 
heterologous plant. A suitable source would be tissue from 
plants containing cyanogenic glycosides or glucosinolates. 

They may also be used as an RFLP marker to determine, 
for example, the location of the cytochrome P-450 
monooxygenase gene or a closely linked trait in the plant 
genome or for marker assisted breeding [EP-A 306,139; WO 
89/07647]. 

Using the methods described above it is thus possible to 
isolate a gene that codes for a P-450 monooxygenase. 

Genes encoding cytochrome P-450 monooxygenase can 
be used in a method for producing a purified recombinant 
cytochrome P-450 monooxygenase which monooxygenase 
either catalyzes the conversion of an amino acid preferably 
selected from the group consisting of tyrosine, 
phenylalanine, tryptophan, valine, leucine, isoleucine and 
eye lopentenylgly cine to the corresponding N- hydroxy amino 
acid and the oxime derived from this N-hydroxyamino acid; 



result of protease digestion can be obtained by standard 35 or the conversion of said oxime to a nitrile and the conver- 



techniques such as Edman degradation. Oligonucleotides 
specifying DNA coding for partial protein sequences of the 
inventive monooxygenases are obtained by reverse transla- 
tion of parts of the protein sequence according to the genetic 
code. Protein sequences encoded by DNA sequences of low 40 
degeneracy are preferred for reverse translation. Their length 
ranges from 4 to 15 and preferably from 5 to 10 amino acids. 
If necessary the codons used in the oligonucleotides can be 
adapted to the codon usage of the plant source (Murray et al. 
Nucleic Acids Research 17:477-498, 1989), The obtained 45 
oligonucleotides can be used to probe cDNA libraries as 
described in Sambrook et al, Molecular cloning: A labora- 
tory manual. Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1989, for clones which are able to 
basepair with said oligonucleotides. Alternatively, oligo- 50 
nucleotides can be used in a polymerase chain reaction, the 
methodology of which is known in the art, with plant cDNA 
as the template for amplification. In this case the obtained 
amplification products are used to probe the cDN A libraries. 
Clones encoding cytochrome P-450 monooxygenases are 55 
isolated. 

An alternative method of cloning genes is based on the 
construction of a gene library composed of expression 
vectors. In that method, analogously to the methods already 

described above, genomic DNA, but preferably cDNA, is 60 boUworm and tobacco budworm. The transgenic plants 



sion of said nitrile to the corresponding cyanohydrine; 
comprising 

(a) engineering the gene encoding said monooxygenase to 
be expressible in a host organism such as bacteria, yeast 
or insect cells, 

(b) transforming said host organism with the engineered 
gene, and 

(c) isolating the protein from the host organism or the culture 
supernatant. 

In a preferred embodiment of the invention the method is 
used to obtain purified recombinant cytochrome P-450j-j^, 
P-450q^ or cytochrome P-4507.j^ which has been modified 
by known techniques of gene technology. Preferably the 
modifications lead to increased expression of the recombi- 
nant protein or to altered substrate specificity. 

The inventive DNA molecules can be used to obtain 
transgenic plants resistant to insects or acarids examples of 
which are listed but not limited to those in Table B as well 
as nematodes. Preferably the transgenic plants are resistant 
to Coleoptera and Lepidoptera such as western com root 
worm (Diabrotica virgifera virgifera)^ northern com root 
worm (Diabrotica longicomis barberi), southern com root- 
worm (Diabrotica undecimpunctata howardi)^ cotton 
boUworm, European corn borer, com root webworm, pink 



first isolated from a cell or a tissue capable of expressing a 
desired gene product — in the present case a P-450 
monooxygenase — and is then spliced into a suitable expres- 
sion vector. The gene libraries so produced can then be 
screened using suitable measures, preferably using 
antibodies, and those clones selected which comprise the 
desired gene or at least part of that gene as an insert. 



65 



comprise DNA coding for monooxygenases which catalyze 
the conversion of an amino acid to the corresponding 
N-hydroxyamino acid and the oxime derived from this 
N-hydroxyamino acid; or the conversion of said oxime to a 
nitrile and the conversion of said nitrile to the corresponding 
cyanohydrine. In addition the transgenic plants may com- 
prise monooxygenase genes genetically linked to herbicide 
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resistance genes. The transgenic plants are preferably mono- 
cotyledoneous or dicotyledoneous plants examples of which 
ar listed in Table A. Preferably they are selected from the 
group consisting of maize, rice, wheat, barley, sorghum, 
cotton, soybeans, sunflower, grasses and oil seed rape. The 
plants can be obtained by a method comprising 

(a) introducing into a plant cell or plant tissue which can be 
regenerated to a complete plant, DNA comprising a gene 
expressible in that plant encoding an inventive 
monooxygenase, 

(b) selecting transgenic plants, and 

(c) identifying plants which are resistant to insects, acarids, 
or nematodes. 

The inventive DNA molecules can also be used to obtain 
transgenic plants expressing anti-sense or sense RNA or 
ribozymes targeted to the genes of the endogenous P-450 
monooxygenases the expression of which reduces the 
expression of cytochrome P-450 monooxygenases. Such 
plants show improved disease resistance or nutritive value 
due to reduced expression of cyanogenic glycosides or 
glucosinolates. The plants can be obtained with a method 
comprising 

(a) introducing into a plant cell or tissue which can be 
regenerated to a complete plant, DNA encoding sense 



10 



Especially preferred for the selection of the transform ants is 
an LS medium that does not contain auxin but contains 
cytokinin instead, and to which a selective substance has 
been added. ITie cuUures are kept in the light and are 
transferred to fresh medium at suitable intervals, but pref- 
erably at intervals of one week. Developing green shoots are 
cut out and cultured further in a medium that induces the 
shoots to form roots. Especially preferred within the scope 
of this invention is an LS medium that does not contain 
auxin or cytokinin but to which a selective substance has 
been added for the selection of the transform ants. 

In addition to Agrobacterium-mediated transformation, 
within the scope of this invention it is possible to use direct 
transformation methods for the insertion of the gene con- 
structions according to the invention into plant material. 

For example, the genetic material contained in a vector 
can be inserted directly into a plant cell, for example using 
purely physical procedures, for example by microinjection 
using finely drawn micropipettes (Neuhaus et al. Theoretical 
and Applied Genetics 74:363-373, 1987), clcctroporation 
(D'Halluin et al. The Plant Cell 4:1495-1505, 1992; WO 
92/09696), or preferably by bombarding the cells with 



RNA, anti sense RNA or a ribozyme, the expression of 25 microprojecUles that are coated with the transforming DNA 



which reduces the expression of cytochrome P-450 
monooxygenases according to claims 1 or 8, 

(b) selecting transgenic plants, and 

(c) identifying plants with improved disease resistance or 
nutritive value. 3° 
A number of very efficient processes are available for 

introducing DNA into plant cells, which processes are based 
on the use of gene transfer vectors or on direct gene transfer 
processes. 

One possible method of inserting a gene construct into a 35 
cell makes use of the infection of the plant cell with 
Agrobacterium tumefaciens and/or Agrobacterium 
rhizogenes, which has been transformed with the said gene 
construction. The transgenic plant cells are then cultured 
under suitable culture conditions known to the person skilled 40 
in the art, so that they form shoots and roots and whole 
plants are finally formed. 

Within the scope of this invention is the so-called leaf disk 
transformation using Agrobacterium (Horsch et al. Science 



("Microprojectile Bombardment"; Wang et al. Plant 
Molecular Biology 11:433-439, 1988; Gordon-Kamm et al, 
The Plant Cell 2:603-618, 1990; McCabe et al. Bio/ 
Technology 11:596-598, 1993; Christou et. Plant Physiol. 
87:671-674, 1988; Koziel et al. Biotechnology 11: 194-200, 
1993). Moreover, the plant material to be transformed can 
optionally be pretreated with an osmotic ally active sub- 
stance such as sucrose, sorbitol, polyethylene glycol, glu- 
cose or mannitol. 

Other possible methods for the direct transfer of genetic 
material into a plant ceU comprise the treatment of proto- 
plasts using procedures that modify the plasma membrane, 
for example polyethylene glycol treatment, heat shock treat- 
ment or electroporation, or a combination of those proce- 
dures (ShiUito et al. Biotechnology 3:1099-1103, 1985). 

A further method for the direct introduction of genetic 
material into plant cells, which is based on purely chemical 
procedures and which enables the transformation to be 



227- 1229-1231 1985) can also be used. Sterile leaf disks 45 carried out very eflSciently and rapidly, is described in 
^ Negrutiu et al. Plant Molecular Biology 8:363-373, 1987. 

Also suitable for the transformation of plant material is 
direct gene transfer using co-transformation (Schocher et al, 
BioATechnology 4:1093-1096, 1986). 

The list of possible transformation methods given above 
by way of example does not claim to be complete and is not 
intended to limit the subject of the invention in any way. 

In another embodiment of the invention target plants are 
exposed to a pesticidally effective amount of a cyanogenic 
glycoside to control insects, acarids, or nematodes attacking 
a monocotyledonous or dicotyledonous plant selected from 
the group of plant types consisting of Cereals, Protein Crops, 
Fruit Crops, Vegetables and Tubers, Nuts, Oil Crops, Sugar 
Crops, Forage and Turf Grasses, Forage Legumes, Fiber 
Plants and Woody Plants, Drug Crops and Spices and 
Flavorings. 

The following examples farther describe the materials and 
methods used in carrying out the invention and the subsc- 



from a suitable target plant are incubated with Agrobacte 
rium cells comprising one of the chimaeric gene construc- 
tions according to the invention, and are then transferred into 
or onto a suitable nutrient medium. Especially suitable, and 
therefore preferred within the scope of this invention, are LS 50 
media that have been soHdified by the addition of agar and 
enriched with one or more of the plant growth regulators 
customarily used, especially those selected from the group 
of the auxins consisting of a-naphthylacetic acid, picloram, 
2,4,5 -trichlorop henoxy acetic acid, 2,4- 
dichlorophenoxyacetic acid, indole-3-butyric acid, indole- 
3-lactic acid, indole -3-succinic acid, indole -3- acetic acid 
and p-chlorophenoxyacetic acid, and from the group of the 
cytokinins consisting of kinetin, 6-benzyladenine, 
2-isopentenyl adenine and zeatin. The preferred concentra- 
tion of auxins and cytokinins is in the range of 0.1 mg/1 to 
10 mg/1. 

After incubation for several days, but preferably after 
incubation for 2 to 3 days at a temperature of 20° C. to 40"" 



55 



C preferably from 23° C. to 35° C. and more preferably at 65 quent results. They are offered by way of illustration, and 
25° C. and in difihise light, the leaf disks are transferred to their recitation should not be considered as a lunitation ot 
a suitable medium for the purpose of shoot induction. the claimed invention. 
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EXAMPLES 

Example 1 
Preparation of microsomes 

Seeds of Sorghum bicolor (L.) Moench (hybrid S-1000) 5 
are obtained from Seedtec International Inc. (Hereford, Tex.) 
and germinated in the dark for 2 days at 28° C. on metal 
screens covered with gauze. Transfer of the seeds to germi- 
nation trays is carried out under dim green light. 
Microsomes are prepared from approximately 3 cm tall 
etiolated seedlings. TTie seedlings are harvested and homog- 
enized using a mortar and pestle in 2 volumes (v/w) of 250 
mM sucrose, 100 mM tricine (pH 7,9), 50 mM NaCl, 2 mM 
EDTA and 2 mM DTT. Polyvinylpolypyrrolidone is added 
(0.1 g/g fresh weight) prior to homogenization. The homo- 
gen ate is filtered through 22 /^m nylon cloth and centrifuged 
20 minutes at 48000 g. The supernatant is centrifuged for 1 
hour at 165000 g. The microsomal pellet is resuspended and 
homogenized in isolation buffer using a Potter-Elvehjem 
homogenizer fitted with a teflon pestle. After recentrifuga- 
tion and rehpmogenization, the homogenate is dialyzed 
overnight against 50 mM Tricine (pH 7,9), 2mM DTT under 
a nitrogen atmosphere. 

Example 2 

Enzyme assays: Determination of total cytochrome P-450 
Quantitative determination of total cytochrome P-450 is 
carried out by difference spectroscopy using an extinction 
difference coefficient of 91 mM"^cm"^ for the complex 
between reduced cytochrome P-450 and carbon monoxide 
(A450.490) (Omura et al, J. Biol. Chem. 239:2370-2378, ^" 
1964). Cytochrome P-450 substrate binding spectra are 
recorded with stepwise increased substrate concentration 
until saturating conditions are reached. 

Example 3 35 
Purification of cytochrome P-450 j^j^ and P-450c>;*r 



Buffer A: Buffer B: Buffer C: 4Q 



8.6% glycerol 


8.6% glycerol 


8.6% glycerol 


10 mM KH2POV 


40 mM KH2POV 


40 mM KH2PO4/ 


K2HPO4 


K2HPO4 


K2HPO4 


(pH 7.9) 


(pH 7.9) 


(pH 7.9) 


0.20 mM EDTA 


5.0 mM EDTA 


5.0 mM EDTA 


2.0 mM DTT 


2.0 mM DTT 


2.0 mM DTT 


1.0% RENEX 690 


1.0% RENEX 690 


1 .0% CHAPS 


0.05% RTX-lOO 


0.05% RTX-100 


0.05% RTX-100 




0.2% CHAPS 





45 



Buffers are degassed three times by stirring in vacuo 50 
before detergent and DTT are added. Between each 
degassing, the buffer is flushed with argon. 

Microsomes (400 mg protein in 20 ml) are diluted to 100 
ml with a bufifer composed of 8.6% glycerol, 10 mM 
KH2PO4/K2HPO4 (pH 7.9). The microsomes are solubilized 55 
by slow addition of 100 ml of the same buffer containing 2% 
RENEX 690 and 0.2% RTX-100 and constant stirring for 30 
minutes. Solubilized cytochrome P-450 is obtained as the 
supernatant after centrifugation for 30 minutes at 200 000 g 
in a Beckman 70: 1i rotor. The supernatant (190 ml) is 60 
applied (flow rate 100 ml/h) to a column (5x5 cm) of DEAE 
Sepharose fast flow/S-100 Sepharose (20:80 wet volumes) 
equilibrated in bufifer A. The ion exchange resin DEAE- 
Sepharose is diluted with the gel filtration material 
Sephacryl S-100 in the ratio 1:4 to avoid too high concen- 65 
trations of the cytochrome P-450 enzymes upon binding, 
which sometimes results in irreversible aggregation. The 
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column is washed with 150 ml buffer A after which the total 
amount of cytochromes P-450 including cytochrome 
P-450y^ and cytochrome P-450^^ is eluted with buffer B in 
a total volume of 150 ml. During this procedure, NADPH- 
cytochrome P-450-oxidoreductase and Cytochrome 
remain bound to the column and may subsequently be eluted 
and separated with bufifer B and a gradient of 0-300 mM 
KCl. 

The cytochrome P-450 eluate is adjusted to 1.0% CIIAPS, 
stirred for 30 minutes and then directly applied to a 25 ml 
(2.6x5 cm) column of Reactive yellow 3 sepharose equiU- 
brated in buffer C+1,0% RENEX 690. The flow rate used is 
25 ml/h. ITie column is washed with buffer C until the 
absorbance A^so shows that RENEX 690 is washed out. 
Cytochrome P-450jyj^ does not bind to this column, and is 
obtained in the run-off and wash. Subsequently the column 
is eluted with 400 mM KCl in buffer C. The cytochrome 
P-450^^ containing fractions are combined yielding 
approximately 60 ml and diluted with 5 volumes of buffer C 
to lower the KCl strength and permit rebinding of cyto- 
chrome P-450cfx on a second Reactive yellow 3 column. 
This column is eluted with a KCl gradient (0-500 mM) in a 
total volume of 100 ml in buffer C. This serves to elute 
cytochrome P-450o,^. 

The cytochrome P-450o:c Pool from the yellow agarose is 
diluted 5 times with buffer C to 20-25 mM KCl and apphed 
to a Cibachron blue agarose column (0.9x6 cm) equilibrated 
in buffer C. The flow rate used is 8 ml/h. The column is 
washed with 20 ml buffer C at the same flow rate. Cyto- 
chrome P-450o^ is eluted with a gradient of KCl, 0-2.0 M 
in buffer C in a total volume of 30 ml. 

The runoff from the first yellow 3 agarose column is 
applied (flow rate 40 ml^) to a column (2.8x8 cm) of 
Cibachron Blue Agarose equihbrated in buffer C. ITie col- 
umn is subsequently washed with buffer C and the cyto- 
chrome P-450y^ is eluted with a 0-500 mM linear KCl- 
gradient (2x100 ml) in buffer C. The combined cytochrome 
P-450 fractions are diluted 5 times with buffer C and applied 
(flow rate 7 ml/h) to a column (0.9x5 cm) of Reactive red 
120 agarose equilibrated in buffer C. The column is washed 
with 25 ml buffer C and cytochrome P-450jy^ is eluted with 
a 0-1.0 M KCl hnear gradient (2x30 ml) in b\iffer C. 
Optionally the eluate is gelfiUrated through a Sephadex G-50 
column, equilibrated in a buffer composed of 50 mM potas- 
sium phosphate (pH 7.9)/400 mM KCl/0.1% CHAPS/2 mM 
DTT. The eluted cytochrome P-4503t^ is dialyzed for 2 
hours against 50 mM potassium phosphate (pH 7,9)/2 mM 
DTT, diluted 4 fold with dialysis buffer in an Amicon 
ultrafiltration cell fitted with a YM-30 membrane and con- 
centrated to 1.45 nmols/ml. 

All procedtires are carried out at 4® C. The total cyto- 
chrome P-450 content of the individual fractions is deter- 
mined from the carbon monoxide difference spectrum. The 
absorption spectrum of the oxidized cytochrome P-450 is 
also recorded. The presence of a specific cytochrome P-450 
is monitored by substrate binding spectra. 

Example 4 

Antibody preparation 

Polyclonal antibodies are elicited in rabbits by six 
repeated subcutaneous injections (approx. 15 //g protein per 
rabbit per injection) at 15 day intervals of cytochrome 
P-450 j-yji or P-450o;c isolated by dye column chromatogra- 
phy or denatured enzyme purified by preparative SDS- 
PAGE. Freunds complete adjuvant is included in the first 
injection whereas Freunds incomplete adjuvant is used in 
subsequent injections. The immunoglobulin fractions of the 
antisera are purified by ammonium suffate precipitation 
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(Harboe et al, *A Manual of Quantitative Immunoelectro- 
phoresis: Methods and Applications', Universitetsforlaget, 
Oslo, 1973). The antiobides are monospecific as demon- 
strated by Western blotting. 

Example 5 
Characterization of P'450jYji 
5.1. Substrate binding spectra of P-450y^ 

Cytochrome P-4507^ in the oxidized state has a strong 
absorption peak at 420 nm, representing the low spin state 
of the iron of the heme group. The binding of a ligand to the 
heme group shifts the absorption maximum by changing the 
spin state of the iron. Binding of tyrosine at the catalytic site 
of cytochrome P-450y^ induces a change of the spin state 
of the oxidized iron from low to high spin, and thereby 
changes the absorption maximum from 420 nm to 390 nm 
producing a type I spectrum (Jefcoate, Methods in Enzy- 
mology 52:258-279, 1978). The following experimental 
procedure is used to obtain the substrate binding spectrum: 
two identical cuvettes containing a buffered solution of the 
isolated cytochrome P-450 are prepared. The substrate of the 
enzyme is added to the sample cuvette whereas the same 
volume of buffer is added to the other cuvette. The difference 
spectrum is then recorded in an SLM-Aminco DW2c spec- 
trophotometer. The absorption difference, A39o-42o> is pro- 
portional to the concentration of cytochrome P-450j^ with 
a bound substrate at its active site. If a saturating concen- 
tration of substrate is added to the sample cuvette, the 
absorption difference is proportional to the concentration of 
the substrate specific cytochrome P-450 in the cuvettes. The 
saturating concentration of the substrate is determined by 
titrating the cytochrome P-450 sample with increasing 
amounts of substrate and monitoring A39q.42o- 

If a cytochrome P-450 sample can be saturated with two 
different substrates, there may be two different cytochrome 
P-450 enzymes in the sample, or there may be one cyto- 
chrome P-450 enzyme able to bind to both substrates. To 
discriminate between these possibilities, saturating amounts 
of the two substrates are added sequentially and the A3P0.420 
absorption change is monitored. If, independent of the order 
of addition, the addition of the second sample gives rise to 
an increased A39o_42o value compared to the value after the 
addition of the first substrate, the two substrates are bound 
by different enzymes. If A390-420 remains unchanged upon 
addition of the second substrate, independent of the order of 
addition, both substrates bind to the same active site, i.e. to 
the same cytochrome P-450 enzyme. The data shown in 
Tables C and D below represent results of a typical experi- 
ment. 

TABLE C 

To 500 of isolated cytochrome P-450.j,yr dissolved in 50 mM 
Tricine pH 7,9 tyrosine is added until saturation concentration is 
reached followed by addition of N-hydroxytyrosine: 



initial 



resulting 



Added substrate 


A3P0-420 


dilution factor 


^390-420 


30 //I 5 mM tyrosine 


0,0437 


530/500 


0,0463 


60 jitl 5 mM tyrosine 


0,0496 


560/500 


0,0556 


90 /il 5 mM tyrosine 


0,04S6 


590/500 


0,0573 


+100 /il 20 mM 


0,0409 


690/500 


0,0564 


N-hydroxytyrosine 
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60 
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TABLE D 



Addtion of N-hydroxytyrosine until saturation concentration is 
reached, followed by addition of tyrosine 



10 



15 



Added substrate 


initial 

'^3S>0-420 


dilution factor 


resulting 

■^^390-420 


50 /A 20 mM 


0,0689 


550/500 


0,0758 


N-hydroxytyrosine 








120 ^1 5 mM 


0,0919 


620/500 


0,1140 


N-hyd roxy tyrosine 








140 ji\ 5 mM 


0,0911 


640/500 


0,1166 


N-hydroxytyrosine 








+90 /d 5 mM tyrosine 


0,0726 


730/500 


0,1060 



Both tyrosine and N-hydroxytyrosine produce a type I 
binding spectrum. The data show, that tyrosine and 
N-hydroxytyrosine bind to the same active site, that is the 
same cytochrome P-450, thus demonstrating that cyto- 
chrome P-4507^ is multffunctional. From the amounts of 
cytochrome P-450j^ used the absorption coeflScient (£390. 
42o) is calculated to be 67 cm~'^mM~^. A complete transition 
from a low spin state to a high spin state would have resulted 
in an absorption coefficient of 138 cm"^mM"^. 

5.2. Molecular weight and Amino acid sequence data 
The molecular weight of P-450 as determined by 

SDS-PAGE is 57 kD. 

Amino acid sequences are obtained by automated Edman 
degradation. The internal polypeptides are obtained by 
trypsin digestion of the purified protein and subsequent 
separation of the peptides using reverse phase HPLC. 
N-terminal sequence: 

-MATMEVEAAAATVLAAP" (SEQ ID NO: 3) 
Internal sequences: 
-VWDEPLR- (SEQ ID NO: 4) 
-YVYNLATK- (SEQ ID NO: 5) 
-SDTFMATPLVSSAEPR-- (SEQ ID NO: 6) 
-AQSQDITFAAVDNPSNAVEXALAEMVNNPEVMAK- 

(SEQ ID NO: 7) 
-AQGNPLLTIEEVK- (SEQ ID NO: 8) 
-LVQESDIPK- (SEQ ID NO: 9) 
-ISFSTG" (SEQ ID NO: 10) 
-LPAHLYPSISLH-- (SEQ ID NO: 11) 

5.3. Reconstitution of cytochrome P-4507^ activity: 
Reconstitution of the enzyme activity of a microsomal 

P-450 enzyme is accomplished by insertion of the cyto- 
chrome P-450 enzyme and the corresponding NADPH- 
cytochrome P-450 oxidoreductase into appropriate lipid 
miceUes made from different commercially avaOable lipids. 
Isolation of NAD PH -cytochrome P-450 oxidoreductase is 
done according to Hallder and Moller, Plant Physiol. 
96:10-17, 1990. A mixture of lipids can be used but with 
cytochrome P-4507^ di-lauroyl-phosphatidyl choline 
(DLPC) provides the best enzymatic activity. One rate 
limiting factor of this rate limiting reaction is the number of 
correctly formed complexes of cytochrome P-4507^ and 
NADPH-cytochrome P-450 oxidoreductase. Excess 
amounts of the oxidoreductase and concentrated enzyme 
solutions ensure a sufficient number of active complexes, 

A reconstituted enzyme is obtained using the following 
components: 



Cytochrome P-450-i-yR: 

Oxidoreductase, purified 
from Sorghum bicolor: 



100 /€g/ml in 50 mM potassium phosphate 
bufifer pH 7,9 

100 //g/ml in 50 mM potassium phosphate 
bufifer pH 7,9 
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-continued 



Lipid: 


10 mg/ml di-lauroyi -phosphatidyl choline, 




sonicated in 50 mM potassium phosphate 




bufifer pH 7,9 


NADPH: 


25 mg/ml H2O 


^'*C- tyrosine: 


commercially available from Amersham; 




[U — ^'•C] — L-tyrosine 0.5 //Ci 



5 



10 jUg of the lipid suspension is mixed in a glass vial with 10 
50 fil of the cytochrome P-4507-i^ (0—1.5 pmol) solution. 50 
//I of the oxidoreductase (0—0.15 U) solution is added and 
then 10 /il tyrosine solution and 10 ^1 NADPH solution are 
added and the mixture is sonicated in a Branson 5200 
sonication bath for one minute, llie reaction mixture is 15 
subsequently incubated for 1 hour at 30° C. At the end of the 
incubation period the reaction is stopped by transferring the 
glass vials onto ice. Radioactively labelled intermediates 
formed are extracted into 50 ehtyl acetate, applied to a 
siHca coated TLC plate and developed using an ethyl 20 
acetate/toluene (1:5 v/v) mixture as mobile phase. The 
resultant product, p-hydroxyphenylacet aldehyde oxime is 
visualized by autoradiography of the TLC plate. 
Alternatively, the intermediates are analyzed by reverse - 
phase HPLC coupled to a Berthold radioactiviy monitor. The 25 
HPLC separation was carried out using a nucleosil 100- 
lOC^g column isocratically cluted with 1.5% 2-propanol in 
25 mM Hepes pH 7.9 (Halkier et al, J. Biol. Chem. 
264:19487-19494, 1989). Control samples may be made by 
omitting either cytochrome P-450 oxidoreductase or 30 
NADPH. 

When reconstituted into micelles cytochrome P-4507^ 
catalyzes the conversion of L-tyrosine all the way to 
p-hydroxyphenyl-acet aldehyde oxime. The and turn- 
over number of the enzyme are 0.14 mM and 198 mind, 35 
respectively, when assayed in the presence of 15 mM NaCl, 
whereas the values are 0.21 mM and 228 min~^ when 
assayed in the absence of added sah. 

The formation of p-hydroxyphenyl-acetaldehyde oxime 
demonstrates that cytochrome P-450j-yji is a multi- 40 
functional heme-thiolate protein catalyzing reactions in 
addition to the initial N-hydroxylation of L-tyrosine. The 
E/Z ration of the parahydroxyphenyl-acet aldehyde oxime 
produced by the reconstituted cytochrome P-450j^ and 
determined by HPLC chromatography is 69/31. Using the 45 
TLC/autoradiography system, minute amounts of radiola- 
belled products comigrating with authentic 
p-hydroxybenzaldehyde and l-nitro-2(p-hydroxyphenyl) 
ethane are detected in the reaction mixtures. 

5.4. Inhibitory effect of antibodies against cytochrome 50 
P-450^^ 

The experiments are carried out using monospecific anti- 
bodies against P-4507-y^ as described in section 6.4. which 
uses antibodies against cytochrome P-450^^. The results are 
similiar to those obtained with the antibody against cyto- 55 
chrome P-450^^ except that the cytochrome P-4507^ anti- 
body exerts a stronger inhibitory effect (up to 60%) on 
cyanide production. 

5.5. cDNA libraries and colony screening 

Poly A^ RNA is isolated from 3 cm high etiolated 60 
seedlings of Sorghum bicolor (L.) Moench grown as 
described for seedUngs used for preparation of microsomes. 
The poly A* RNA is used for the construction of a Xgtll 
expression library and a XgtlO library. The construction of 
the Ubraries can be done according to the procedures 65 
described for example in Sambrook et al. Molecular cloning: 
A laboratory manual. Cold Spring Harbor Laboratory Press, 
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Cold Spring Harbor, N.Y., 1989 or can be ordered by 
commercial companies such as Strategene (La JoUa, Calif.). 
Antibodies obtained against cytochrome P450TYR are used 
to screen the expression libraries as described by Young et 
al, Procl. Natl. Acad. Sci USA 82:2583-2587. Antigen- 
antibody complexes are detected enzymatically with alka- 
line phosphatase-conjugated antibodies (Dakopatts). DNA 
from 4 positive plaques is prepared according to 
Grossberger, Nucleic Acid Research 15:6737, 1987. Inserts 
from X phages are subcloned into pBluescript II SK 
(Strategene). Comparison of the deduced amino acid 
sequence from one of the four inserts with the amino acid 
sequences obtained from protein sequencing of P4503~j^ 
shows that this clone is a partial cDNA clone for P450j-y^. 
The partial cDNA clone is used as a probe for a new screen 
of the XgtlO and Kgtll Ubraries. The insert sizes of 45 
positive clones are determined by southern blotting. Addi- 
tionally the 45 positive clones arc examined for hybridiza- 
tion with two different mixtures of oligonucleotides by 
southern blotting. The sequences of the oligonucleotide 
mixtures are based on the partial amino acid sequence data 
and specify a sequence near the N-terminal end (amino acids 
4 to 9) and a sequence near the C-terminal end (amino acids 
533—538). Oligonucleotide synthesis is carried out on a 
Cyclone Plus DNA Synthesizer. Sequencing of one clone 
derived from the XgtlO library showing the expected size 
and hybridizing with the two oligonucleotide mixtures 
shows that the clone is a full-length cDNA clone encoding 
cytochrome P450:^^. 

Oligonucleotide specifying amino acids 4 to 9 (MEVEAA) 
(SEQ ID NO: 17) 5'-ATG-GA[G,A]-GT[C,G,T,A]-GA 
[GA]-GC[CGTA]-GC-3' 
Oligonucleotide specifying amino acids 533 to 538 
(DFTMAT) (SEQ ID NO: 18) 5'-GA[C,T]-AC[C,G,T,A]- 
TT[C,T]-ATG-GC[C,G,TA]-AC-3' 

5.6. DNA sequencing 
DNA sequencing is carried out by the dideoxy chain 

method (Sanger et al, Proc. Natl. Acad. Sci. USA 
74:5463-5467, 1977) using [^^S]-dATP. T7 DNA poly- 
merase and deoxynucleotides are obtained from Pharmacia, 
dideoxynucleotides from Boehringer Mannheim. Sequenc- 
ing of the full-length cDNA clone is done partly by sub- 
cloning and partly by using synthetic oligonuecleotides as 
primers. The oligonucleotide primers can be ordered with 
commerical companies. 

5.7. Southern blotting 
XDNA isolated from the positive clones is digested with 

Eco Rl. The inserts are separated from >^DNA by electro- 
phoresis on a 0.7% agarose gel. After electrophoresis, DNA 
is capillary blotted onto a Zetaprobe membrane (Biorad) 
using 10 mM NaOH for the transfer. Hybridization is 
performed at 68° C. in 1.5x SSPE (270 mM NaCl, 15 mM 
Na^HPO^ pH 7.0, 1.5 mM EDTA, 1% sodium dodecyl 
sulphate) 10% dextransulphate, 0.5% skim milk and 0.1 
mg/ml salmon sperm DNA for 16 hours. When the partial 
cDNA clone is used as probe for hybridization it is labeled 
with [ct-^^P]dCIP using a random prime labelling kit 
(Amersham International pic). The oligonucleotide mix- 
tures are 5'end labeled according to Okkels et al. (Okkels et 
al, FEBS Letters 237:108-112, 1988). The filters are washed 
first in 2x SSC (0.9M NaCI, 0.09M trisodium citrate, 0.1% 
SDS) (SEQ ID NO: 24) at 47** C. for 15 min., then in a fresh 
solution of the same composition at 56** C. for 15 min. and 
finally in O.lx SSPE, 0.5% SDS for 30 minutes at 65° C. The 
presence of radioactively labelled hybridization bands on the 
filter is monitored by X-ray autoradiography. 
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5.8. Characterization of a full-length cDNA clone 
>uDNA isolated from the positive clones is digested with 

Eco RI. The insert is separated from XDNAby electrophore- 
sis on a 0.7% agarose gel and subcloned into the EcoRl site 
of the vector pBluescript SK (Strategene) contained within 
the sequence GCAGGAATTCCGG. The four last bases of 
this sequence are listed as the first four bases in SEQ ID NO: 

I, A clone comprising the described cDNA has been depos- 
ited with the Agricultural Research Culture Collection 
(NRRL), 1815 N. University Street Peoria, 111. 61604 U.S.A. 
under the accession number NRRL B-21168. 

The orientation of the insert in the vector is determined as 
the polylinker restriction site for Pst I being adjacent to the 
5 'end of the cytochrome P-450 sequence. The sequence of 
the insert is shown in SEQ ID NO: 1. The sequence 
comprises an open reading frame (ORE) starting at nucle- 
otide 188 and ending at nucleotide 1861 of SEQ ID NO: 1. 
It encodes a protein of 558 amino acids and a molecular 
mass of 61887 Da shown in SEQ ID NO: 2. The sequence 
comprises the sequences of SEQ ID NO: 3 to SEQ ID NO: 

II. The protein is not subject to post-translational modifi- 
cation at the N- and C-terminal ends except for the removal 
of the N-terminal methionine residue. The N-termninal 
region of cytochrome P-4507-i^, however, shows four motifs 
which in animals are known to target heme-thiolate proteins 
to the endoplasmatic reticulum. 

Searches for sequence similarity are made using the 
programmes BLAST and FASTAin the nucleotide sequence 
data bases provided by the EMBL. Pairwise comparisons of 
cytochrome P-450y^^ with other cytochrome P-450 
sequences are performed using the programme GAP of the 
Genetics Computer Group GCG sofware package. Multiple 
alignments were made using the GCG programme PILEUP. 

5.9. Expression of native cytochrome P-4507^ in E, coli 
Plasmid pCWOri+ (Gegner et al, Prod. Natl. Acad. Sci. 

USA 88:750-754, 1991) is used to express the wildtype 
cytochrome P-450j-j^ cDNA sequence as described by Bar- 
nes et al. Prod. Natl. Acad. Sci. USA 80:5597-5601, 1991. 
cDNA sequences are introduced into the expression plasmid 
using polymerase chain reaction (PCR) mutagenesis. A 
synthetic ohgonucleodde (TYROLlb) containing an amino 
acid -conserving and nucleotide modifying 5' cDNA 
sequence is used in conjunction with a downstream oligo- 
nucleotide (TYROL3) to ampUfy the N-terminal sequence 
between the ATG initiator codon (contained within an Ndel 
site) and a unique BamHI restriction site within the cyto- 
chrome P-4507^ sequence. A synthetic oligonucleotide 
(TYROL2) is used in conjunction with an oligonucleotide 
(l'yROL4) complementary to a unique PstI restriction site 
to introduce a Hindi II restriction site immediately down- 
stream of the TGA stop codon. The expression plasmid 
pCWtyr is constructed by simultaneous ligation of the 278 
basepair PCR Ndel/BamHI fragment, the 1257 basepair 
BamHI/PstI fragment of cytochrome P-4507^ and the 146 
basepair PCR Pstl/Hindlll fragment with the Ndel/Hindlll 
cleaved vector DNA. E. coli strain JM 109 transformed with 
plasmid pCWtyr is grown in LB/ampicilline medium at 37° 
C. Expression of cytochrome ¥-A50j^ is obtained by grow- 
ing the cells in a medium containing 1 mM isopropyl 
beta-D-thiogalactopyranoside (IPTG) and shifting the cells 
to growth at 28° C. al 125 rpm. E, coli produces a func- 
tionally active cytochrome P-450y^ enzyme which con- 
verts tyrosine into oxime. The analytical procedures are as in 
the reconstitution experiments described in section 5.3. 
above. The expressed cDNA clone encoding P-4507-j^ 
specifies the synthesis of a single cytochrome P-450 
enzyme. Since this enzyme catalyzes the conversion of 



J3,417 

18 

tyrosine all the way to p-hydroxyphenylacctaldehyde oxime, 
this unambigeously demonstrates that cytochrome P-4507^ 
is multifunctional, 

ITie following oUgonucleotides are used: 
5 TYROLlb (SEQ ID NO: 19) 

5'-CGG GAT CCA TAT GCT GCT GTT ATT AGC AGT 

TIT TCr GTC GTA-3' 
TYROL2 (SEQ ID NO: 20) 

5'-GAC CGG CCG AAG C7T TAA TTA GAT GGA GAT 
10 GGA-3' 

TYROL3 (SEQ ID NO: 21) 

5'- AGT GGA TCC AGC GGA ATG CCG GCT T-3' 
TYROL4 (SEQ ID NO: 22) 
5^-CGT CAT GCT CTT CGG AA-3' 

15 5.10. Expression of truncated and modified cytochrome 
P-450j^ in E. coli 

A modified cytocrome P-450y-y.^, in which the 35 
N-terminal amino acids are replaced by the nine N-terminal 
amino acids from bovine 17a hydroxylase is introduced into 

20 the expression vector pSP19glOL which can be obtained 
from Dr. Henry Barnes (La Jolla, Calif.). This plasmid 
contains the lac Z promoter fused with the known short 
leader sequence (glOL) of gene 10 from bacteriophage T^ 
(Olin et al, 1988). A construct containing the N-terminal 

25 amino acids from bovine 17a hydroxylase and a truncated 
form of the P-4507^ gene is designed using PCR mutagen- 
esis: Oligonucleotide TYROL Id (5*-CGG GAT CCA TAT 
GGC TCr GTT ATP AGC AGT TCT GTC GTA CCT 
GGC CCG-3'; SEQ ID NO 23) containing a 5' mutant cDNA 

30 sequence as well as a BamHI and Ndel restriction site is 
used together with oligonucleotide TYROL 3 comprising 
the sequence surrounding the unique BamHI restriction site 
downstream of the ATG start codon of the cDNA coding for 
P-4507-y5,j to amplify a modified N-terminal sequence of 

35 P-450yY/c- Th^ amplification product is cut with Ndel and 
BamHI restriction enzymes. To introduce a Hindlll site 
immediately downstream of the stop codon of the 
gene, oligonucleotides TYROL 2 and TYROL 4 are used in 
a polymerase chain reaction to obtain a C-terminal fragment 

40 of the P-4503^ gene comprising a Hindlll restriction site 
immediately downstream of the stop codon. The ampfifica- 
tion product is cut with PstI and Hindlll restriction enzymes. 
The complete expression plasmid is constructed by simul- 
taneous ligation of the N-terminal Ndel/BamHI fragment, 

45 the BamHI/PstI fragment of the P-4507^ gene and the 
C-terminal Pstl/Hindlll fragment into Ndel/Hindlll cleaved 
pSP19gl0L vector DNA. The expression vector obtained is 
transformed into E. coli strain JM109, Transformed E. coli 
produce 300 nmol to 500 nmol cytochrome P-450y^ per 

50 liter cell culture upon growth at 28° C. in the presence of 1 
mM isopropyl -p-D-thiogalactopyranoside and at 125 rpm. 
Expression levels as high as 900 nmol per liter, equivalent 
to 55 mg P-4507y7j per liter, have been obtained. 

Administration of tyrosine to the cell culture results in the 

55 production of p-hydroxyphenylacetaldehyde oxime, 
whereas a cell culture transformed with pSP19gl0L alone 
does not produce the oxime. 

Reconstitution experiments with E. cole -expressed cyto- 
chrome P-450yy7j and sorghum NADPH cytochorme P450 

60 reductase in dilaurylpohsphatidylcholine micelles is per- 
formed as described in section 5.3. above. Tiu-nover rates of 
349 nmol oxime per nmol P-4507^^ per minute can be 
demonstrated, which is equivalent to the values obtained 
with sorghum P-450j^. 

65 Purified sorghum cytochrome P-4507^j^ can be shown to 
form type I substrate binding spectra with tyrosine and 
N-hydroxytyrosine (compare section 5.1.). Using P-450j^ 
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expressed in E. coli it can be shown that in addition to 
tyrosine and N-hydroxytyrosine P-450y^ is also able to 
form a type Ispectnim with p-hydroxyphenylacetaldehyde 
oxime, 2-nitro-(p-hydroxyphenyl)ethane, 
p-hydroxyphenylacetonitrile as well as phenylalanine. The 
molar extinction coefiBcient E420-390 tyrosine and 
N-hydroxytyrosine as genuine substrates of P-450y^ are 
75.8 cm~^mM"^ and 64.6 cm"^mM"^, respectively, whereas 
the extinction coeflScients of the other compounds vary from 
20-40 cm-^mM-\ 

Reconstitution experiments using phenylalanine as sub- 
strate do not result in the production of the corresponding 
oxime. This indicates, that cytochrome P-4507^ has a 
narrow substrate specificity with respect to its enzymatic 
activity although it is able to bind many tyrosine analogues, 

Administration of ^'^C-tyrosine directly to E. coli cells 
expressing cytochrome P-450j^^ results in the production of 
p-hydroxyphenylacetaldehyde oxime, indicating that E, coli 
is able to provide the reducing equivalents for cytochrome 
P-450:™. 

ITie following oligonucleotides are used: 
TYROLld (SEQ ID NO: 23) 

5'-CGG GAT CCA TAT GGC TCT GTT ATT AGC AGT TT 

TCT GTC GTA OCT GGC CCG-3' 
TYROL2 (SEQ ID NO: 20) 

5'-GAC CGG CCG AAG CTT TAA TTA GAT GGA GAT 

GGA-3' 
TYROL3 (SEQ ID NO: 21) 

S'-AGT GGATCC AGC GGA ATG CCG GCT T-3' 
TYROL4 (SEQ ID NO: 22) 
5'-CGT CAT GCT CTT CGG AA-3' 

Example 6 
Characterization of P-450^^ 

6.1. Substrate binding spectra of ^ASO^^ 

Similiar experiments as reported in section 5.1 are carried 35 
out using isolated cytochrome P-450c>^ with 
p -hyd roxypheny lacetaldoxime and 
p-hydroxyphenylacetonitrile as substrate. Cytochrome 
P-450^^ is found to be multifunctional as P-4507^. Isolated 
cytochrome P-450o^ resembles the cytochrome P-450 40 
reported to convert oximes to nitrites in rat liver microsomes 
(DeMaster et al, J. Org. Chem. 5074-5075, 1992). 

6.2. Molecular weight and Amino acid sequence data 

The molecular weight of P-450c»^ as determined by SDS- 
PAGE is 51 kD. Amino acid sequences are obtained by 45 
automated Ed man degradation. The internal polypeptides 
are obtained by trypsin digestion of the purified protein and 
subsequent separation of peptides using reverse phase 
HPLC. 

N-terminal sequence: 50 
"MDLADIPKQQRLMAGNALW" (SEQ ID NO: 12) 
Additional peptide sequences: 
"ARLAEIFATII- (SEQ ID NO: 13) 
-EDFTVTTK- (SEQ ID NO: 14) 

-QYAALGSVFTVPII-- (SEQ ID NO: 15) 55 
"XXPFPI- (SEQ ID NO: 16) 

6.3. Reconstitution of cytochrome P-450<3.^ activity: 

The reconstitution assay used for cytochrome ^-ASO^x is 
similiar to that used for cytochrome ^~A50j^j^ described in 
section 5.3. A typical assay contains 10 //I DLPC (10 60 
mg/ml); 50 fA cytochrome V-A50^^ (24 ^ag/mt); 50 /A 
NAD PH -cytochrome P-450 oxidoreductase; 20 ^1 of either 
10 mM p-hydroxyoheny lacetonitrile or 
p-hydroxyphenylacetaldehyde oxime; 10/^1 NADPH (25 
mg/ml); and 60 jul potassium phosphate buffer (pH 7.9). 

The reconstitution assay demonstrates that cytochrome 
P-450o;c converts p-hydroxyphenylacetaldehyde oxime to 



p-hydroxymandelonitrile. The analytical procedures used 

are those described for cytochrome P-450yy3,. 

6.4. Inhibitory effect of antibodies against cytochrome 

P-450o^ 

The effect of antibodies raised against cytochrome 
P-450q,^ on the biosynthetic activity is measured as the 
decrease in cyanide production upon incubation of the 
sorghum microsomes with p-hyd roxypheny lacelaldehyde 
oxime and p-hydroxybenzytcyanide as substrates. The com- 
position of the 150 //I total volume reaction mixtures is: 
microsomes containing 33 //g protein, 1.5 jumoX substrate, 
7.5 //mol tricine pH 8.0, 0.33 /miol NADPH, 0-255 fi% 
antibodies and 0-255 jWg reference immunoglobulin. The 
total amount of immunoglobulin in the assay is in each 
sample adjusted to 225 //g using purified immunoglobulin 
from a nonimmunized rabbit. The antibodies are preincu- 
bated with the microsomes for 15 minute at 30° C. before 
substrate and NADPH are added. Subsequently the reaction 
is incubated at 30° C. for 30 minutes. Cyanide is determined 
by the Konig reaction (Konig, Z. Angew. Chem. 18:115, 
1905) using methodology described in HaDcier and Moller, 
Plant Physiol. 90:1552-1559, 1989. A value oi A^^^_^q^^1 .5 
corresponds to 10 nmoles cyanide. Protein concentration 
was determined using the method of Bradford (Bradford, 
Anal. Biochem. 72:248-254, 1976). A typical result of such 
an inhibition experiment is shown in Table E. 



TABLE E 


j«g antibody 


0 15 30 60 


120 


225 




Substrate p-hydroxyphenylacetaldehyde oxime 




inhibLLLon 


1,00 0.96 0,92 0.87 
0% 4% 8% 13% 
Substrate p-hydroxybenzylcyanide 


0,88 
12% 


0,66 
34% 


^680-585 

inhibition 


1,07 1,03 1,03 0,88 
0% 4% 4% 18% 


0,87 
18% 


0,70 
35% 



The data show, that the antibody inhibits the reactions to 
the same extent whichever substrate is added to the microso- 
mal preparation. 

Example 7 

Induction of glucosinolate production in Tropaeolum majus. 

Seeds of Tropaeolum majus L. cv Empress of India 
(Dansk Havefroforsyning, Kolding, DK) are allowed to 
imbibe and germinate in complete darkness for one week at 
25° C. In vivo biosynthesis experiments are performed 
wherein 1 //Ci of the tracer ^"^C-labelled phenylalanine is 
administered to excised dark -grown seedlings for 24 hours 
followed by boiling of the plant material in 90% methanol 
and analysis of the extracts by HPLC as described by 
Lykkesfeldt and Moller, 1993. Prior to administration of the 
tracer to the excised seedling or leaf, the intact plant is 
subjected to a potential inducer for 24 hours. Administration 
of 10 mM phenylalanine or 2% ethanol to the vermiculate in 
which the etiolated seedlings are grown results in a threefold 
increase in glucosinolate production as compared to control 
experiments with water. Spraying with 100 //M jasmonic 
acid followed by incubation for 24 hours results in a fivefold 
induction in etiolated seedlings and green leaves. 

Example 8 

Preparation of biosynthetically active microsomes from 
glucosinolate-producing plant meterial 

The biosynthetic pathways of glucosinolates, and cyano- 
genic glucosides share homology by having amino acids as 
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precursors and oximes as intermediates. The assignment of 
amino acids and oximes as precursors and intermediates in 
the glucosinolate biosynthetic pathway is based on in vivo 
experiments demonstrating that these compounds are effi- 
cient precursors for glucosinolates. In vitro biosynthetic 
studies have hitherto not been possible due to the detrimen- 
tal effect of the degradation products of glucosinolates on 
enzyme activities. The degradation products are formed 
upon disruption of the cellular structure. In the disrupted 
tissue, the glucosinolate -degrading enzyme myrosinase gets 
in contact with the glucosinolates resulting in the generation 
of isothiocyanates inactivating the enzymes. We demon- 
strate that microsomal preparations isolated form one week 
old plants of either Sinapis alba or Trapaeolum ma jus are 
able to convert tyrosine and phenylalanine, respectively, to 
the corresponding oximes. The enzymatically active 
microsomal preparations are obtained by using an isolation 
buffer fortified with 100 mM ascorbic acid known to inhibit 
the activity of myrosinase and by inducing the 
glucosinolate -producing enzyme system prior to the prepa- 
ration of microsomes. The glucosinolate-producing enzyme 
systems are induced by taking 7-day s-old dark-grown Sina- 
pis plants or 3-4 weeks old light-grown Tropaeolum plants 
and placing them in the light for 3 days. During this three 
day period, the young plants are sprayed with 50 
jasmonic acid once a day. After 3 days of induction, the 
plants are harvested and microsomes are prepared as 
described in section 5.1, except that the homogenisation 
buffer consists of 250 mM Tricine pH 7.9, 250 mM sucrose, 
50 mM sodium bisulfite, 100 mM ascrobic acid, 4 mM DDT, 
2 mM EDTA, 1 mM PMSF, and 5 mg/ml BSA. The 
microsomal preparation is dialysed against homogenization 
buffer for 1 hour, followed by dialysis against 50 mM 
Tricine pH 7.9 and 2 mM DTT for another hour. 

Example 9 

In vitro biosynthesis of oxime by extracts from 
glucosinolate -containing plants 

The microsomal reaction mixture consists of 80 fA 
microsomes (10 mg protein per ml), 10 /il 
C-phenylalanine (0.5 //Ci, 464 mCi/mmol, Amersham) or 
^'*C-tyrosine (0.5 //Ci, 450 mCi/mmol, Amersham) and 10 //I 
NADPH (75 mg/ml). llie reaction mixtures are incubated 
for 1 hour at 37° C. At the end of the incubation period, the 
reaction mixtures are extracted with 1500 ^\ ethyl acetate. 
The ethyl acetate phase is evaporated to dryness, redissolved 
in a small volume and analyzed. The production of oximes 
in the microsomal reaction mixtures can be demonstrated by 
thin layer chromatography as well as by HPLC analysis as 
described in section 5.3. 

Example 10 50 
Involvement of cytochrome P450-dependent monooxygena- 
ses in the glucosinolate pathway 

Based on the similarity between the first part of the 
biosynthetic pathways of glucosinolates and cyanogenic 
glucosides, it was anticipated that the conversion of amino 55 
acid to oxime in the glucosinolate pathway is catalyzed by 
a multifunctional cytochrome P450 monooxygenase 
homologous to P450^ in the cyanogenic glucoside pathway. 
In vivo experiments, where radioactively labelled phenyla- 
lanine is administered to etiolated tropaeolum seedlings in 60 
the presence and absence of 1 mM of the cytochrome P450 
inhibitors eniLketoconazoI and tetcyclacis demonstrate that 
cytochrome P450 inhibitors cause a reduction of glucosino- 
late without causing a reduction in the uptake of phenyla- 
lanine as measured by ethanol extraction of the plant mate- 65 
rial. This indicates that the biosynthesis of glucosinolates is 
dependent on cytochrome P450, 



Direct demonstration of the involvment of cytochrome 
P450 in glucosinolate biosynthesis can be obtained using the 
in vitro microsomal enzyme system from tropaeolum to 
demonstrate photoreversible carbon monoxide inhibition of 
oxime production. The microsomal reaction mixtures are 
incubated using different experimental conditions. The reac- 
tion mixtures are analyzed by HPLC. 



Experimental condition 


% inhibition of oxime production 


O2 without light 


0 


O2 with light 


11 


CO/O2 without light 


65 


CO/O2 with light 


23 



40 



45 



The possibility to reactivate the microsomal enzyme 
system upon irradiation with 450 nm light shows, that the 
conversion of phenylalanine to the corresponding oxime in 
the biosynthetic pathway of glucosinolate is dependent on 
cytochrome P450. 

Example 11 
Toxicity of cyanogenic glycosides for insects. 

Insects or insect larvae are fed on a diet containing added 
cyanogenic glycoside, a diet containing added cyanogenic 
glycoside and callus, or a diet supplemented with the super- 
natant of callus ground-up in the presence of the cyanogenic 
glycoside. Mortality is compared to the mortality of insects 
or insect larvae fed on the diet only. 

Example 12 

Activity of Amygdalin on larval mortality of Western Com 
Root Worm (WCRW): 

WCRW larvae are fed on a diet with added amygdalin, a 
diet with added amygdalin and Black Mexican Sweet (BMS) 
callus or on a diet supplemented with the supernatant of 
BMS-caUus ground-up in the presence of amygdalin. Larval 
mortality is compared to the mortality of larvae fed on the 
diet only. 

The results show that amygdalin is lethal in the presence 
of BMS-callus with an LC50 of ~1 mg/ml, and that it is lethal 
at 2 mg/ml in the absence of BMS-callus. There is signifi- 
cantly less lethality at amygdalin concentrations of less than 
1 mg/ml when BMS-callus is absent. 

Example 13 

Activity of Dhurrin on larval mortality of Western Com 
Root Worm (WCRW): 

The activity of dhurrin on larval mortality of WCRW was 
determined as described for amygdalin in example 12. 

The results show that the LC50 of dhiurrin is 368 /<g/ml 
with 95% confidence limits of 0.28-0.48 ^ag/ml. The slope of 
the regression line is 2.5. 

Example 14 

Transfection of maize by direct Bombarding of Immature 
Zygotic Embryos and Isolation of Transformed CaUus with 
the Use of Phosphinothricin as a selection agent. 

Immature embryos are obtained approximately 14 days 
after self-poUination. The immature zygotic embryos are 
divided among different target plates containing medium 
capable of inducing and supporting embryogenic callus 
formation at 36 immature embryos per plate. The immature 
zygotic embryos are bombarded with plasmids encoding a 
cytochrome P-450 monooxygenase and a chimeric gene 
coding for resistance to phosphinothricin using the PDS- 
1000/He device from DuPont The plasmids are precipitated 
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onto 1 /im gold particles essentially according Du Font's 
procedure. Each target plate is shot one time with the 
plasmid and gold preparation and phosphinothricin is used 
to select transformed cells in vitro. Selection is applied at 3 
mg/i one day after bombardment and maintained for a total 
of 12 weeks. The embryogenic callus so obtained is regen- 
erated in the absence of the selection agent phosphinothri- 
cin. The regenerated plants are tested for their resistance to 
insects, acarids or nematodes. 

Although the foregoing invention has been described in 
some detail by way of illustration and example for purposes 
of clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the 
scope of the appended claims. 

TABLE A 

PLANT CLASvSlFICATION ACCXDRDINCt TO USE 
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Mo no cot 



Avena nuda (chinensis) 

A. saliva 

Eleusine coracan 

Eragrostis tef 

Fagopyrum esculentum 

F. tataricum 

Hordeum distichum 

H. vulgare 

Oryza saliva 

Panicum ilalicium 

P. miliaceum 

Penniseium glaucum 

P. spicatum {americatmm) 

Secale cereale 

Sorghum vulgare 

X Triticosecale 

Trilicum aestivum 

T. dicoccum 

T. durum 

T. mortococcum 

Zea mays 

Dicot 

Amarartihus partictdatus 
Fagopyrum esculenlum 

F. tataricum 
PROTEIN CROPS 

Dicot 

Arachis hypogea 
Cafanus indicus 
Cicer arietinum ■ 
DoUchos lablab 
Glycine gracilis 

G. max 

G. ussuriensLs 
Lathyrus salivas 
Lens culinaris 
Mucuna pruriens 

Phaseoliis acutifoUus 

P. aureus 

P. luttatus 

P coccineus 

(multiflorus') 

P. mungo 

P. vulgaris 

Vicia faba 
Vigna angularis 

V sesquipedalis 

V sinensis 



Chines naked oat 
Common oats 
African millet 
Tef grass 
Buckwheat 
Rye buckwheat 
Two- row barley 
Barley 
Rice 

Italian millet 
Broomcorn millet 
Spiked millet 
Perl millet 
Rye 

Grain sorghums 
Triticale 
Common wheat 
Emmer 

Abyssinian hard wheat 
Einkom wheat 
Corn, sweet com 



Rispenfuchsschwanz 
Buchweizen 



Groundnut, peanut 
Pigeon pea 
Chickpea 
Hyacinth bean 
Manchurian Soya 
Soyabean 
Wild soya 
Grass pea 
Lentil 

Cowitch, Florida velvet 
bean 

Tepary bean 
Mung» green gram 
Lima bean, Sieva 
Scarlet runner bean 

Black gram 

French, common, kidney or 
dwarf bean 

Horse bean, broad bean 
Adzuki bean 

Asparagus (yard-long bean) 
Cowpea 
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lABLE A-continued 



PLANT CLASSIFICATION ACCORDING TO USE 



FRurr CROPS 



Dicot 



55 



Amygdalus communis 
Ananas comosus 
Artocarpus communis 
Carica papaya 
Cilrullus vulgaris 
Citrus grandis 
C. medica 
C. nobilis 
C. reticulata 
C. sinensis ^ 
Cydonia oblonga 
Diospyros fcaki 
Ficus carica 
Fragaria chiloensis 
F. virginiana 
Litchi chinensis 
Malus asiatica 
M. pumila 
Mangifera indica 
Morus rubra 
Musa cavendishii 
M. paradisiaca 
Passiftora edulis 

P. ligidaris 

Persea americana 

Phoenix dactylifera 

Prunus armeniaca 

P. avium 

P. cerasifera 

(divaricata) 

P. cerasus 

P. domestica 

P. maheleb 

P. persica 

P. pseudocerasus 

P salicinia 

P serotina 

Psidium guajava 

Purtica granatum 

Pyrus communis 

P. ussuriensis 

Ribes grossularia 

R. nigrum 

R. rubrum 

Rubus idaeus 

R. strigosus 

Tamarindus indica 

Vaccinium anguslifolium 

V. ashei 

V. corymbosum 

V. mynilloides 

V. oxycoccos 

Vibumum trilobum 

Vitris labrusca 

V. vinifera 

VEGETABLES AND TUBERS 
Monocot 
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Allium ascalonicum 

A. cepa 

A. chinense 

A. fislulosum 

A. porrum 

A. sativum 

A, schoenoprasum 

Asparagus officinalis 

Zea mays 

Dicot 

Amoracia lapaihifolia 
Apium graved ens 



Almond 

Pineapple 

Breadfruit 

Papaya 

Watermelon 

Pummelo 

Citron, lemon 

Tangerine 

Mandarin 

Orange 

Quince 

Japanese persimmon 
Fig 

Wild strawberry 

Strawberry 

Litchi 

Chines apple 

Appple 

Mango 

Red mulberry 

Banana 

Banana 

Passion fruit, purple 
granadilla 
Passion flower 
Avocado pear 
Date palm 
Apricot 

Sweet cherry, mazzard 
Cherry plum 

Cherry 

European plum or prune 
Maheleb cherry 
Peach and nectarine 
Cherry 

Japanese peach 
Wild black cherry 
Guava 

Pomegranate 
Pear 

Chinese pear 

Gooseberry 

Black currant 

Red and white currant 

European raspberry 

American raspberry 

Tamarind 

Sugarberry 

Rabbitcyc blueberry 

Highbush blueberry 

Canada blueberry 

Cranberry 

American cranberry bush 

Fox grape 

Grape 



Shallot, breen onion 

Onion 

Onion 

Welch onion 
Leek 
Garlic 
Chives 

Asparagus (var. altilis) 
sweet corn 



Horseradish 
Celery 
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TABLE A-continued 



PLANT CLASSinCATION ACCORDING TO USE 



PX.ANT CLASSIFICATION ACCORDING TO USE 



Arabidopsis thaliana 
Beta vulgaris 

Brassica alboglabra 
B. campestris 
B. carinata 
B. cemea 
B. chinensis 

B. hirta 
B. JuTtcea 

B. kaber 

B. tmpobrassica 

B. naptis 

B. nigra 

B. oleracea 



B. pekinensis 

B. rapa 

Cajartus cajart (indicus) 
Canavalia ensiformis 
Canna edulis 
Capsicum annuum 

C. chinense 
C. frutescens 
C. pendulum 
C. pubescens 
Cichorium endivia 
C. intybus 

Colocasia antiquorum 
Crambe maritima 
Cucumis melo 
C. sativus 
Cucurbita ficifolia 
C. foetidissima 
C. maxima 
C. moschata 

C, pepo 

Cyrtara scolymus 
Daucus carota 
Dioscorea alata 

D. batatas 

D. cavetmensis 
Eruca sativa Mill. 

Jpomea batatas 
Lactuca sativa 
Lepidium sativum 
Lycopersican cerasiforme 
L. esculentum 
Manihot esculenta 
Nasturtium officinale 
Pastirtaca sative 
Petroselinum crispum 
(sativum) 

Physalis peruviana 
Pisum sativum 
Raphanus sativus 
Rheum officinale 
R. rhaponticum 
Scorzonera hispanica 
Sechium edule 
Solanum andigenum 
S. melongena 
S. muricatum 
S. phureja 
S. tuberosum 
Psinacia oleracea 



Common wall cress 
Sugar, mangold or garden 
beet 

Chinese kale 

Turnip rape 

Ambyssian mustard 

Karashina 

Chinese mustard or 

pak-choi 

White mustard 

Pai, brown mustard, Indian 

mustard 

Charlock 

Swede or rutabaga 
Rape, oil rape, kale 
Black mustard 
Cole, kale, coUards, 
brussels sprouts, 
cauliflower, cabbage, 
kohlrabi, broccoli 
Chines cabbage or celery 
cabbage 
Tbrnip 
Pigeon pea 
Jack bean 
Edible canna 

Common cultivated pepper 
Pepper 

Cayenne pepper 

Pepper 

Pepper 

Endive 

Chicory 

Taro 

Sea kale 

Melon, cantaloupe 

Cucumber 

Malabar gourd 

Calabazilia, bufifalo gourd 

Pumpkin 

Winter pumpkin 

Summer squash, vegetable 

marrow 

Globe artochoke 

Carrot 

Yam 

Chines yam 
Attoto yam 

Rocket salad, rocket or 

roquette 

Sweet potato 

Lettuce 

Garden cress 

Cherry tomato 

Tomato 

Manioc, cassava 
Water cress 
Parsnip 
Parsley 

Ground cherry 

Pea 

Radish 

Rhubarb 

English rhabarb 

Black salsify 

Chayote 

Andean potato 

Eggplant 

Pepino 

Potato 

Common potato 
Spinach 



5 NUTS 



Dicot 



Anacardium occidentale 
Arachis hypogaea 
10 Carya illinoinensis 
C. ovata 
Castanea sativa 
Covos nucifera 
Corylus americana 
C. avellana 
J 5 Juglans nigra 
J. regia 
J, sinensis 
Litchi chinensis 
Macadamia integrifolia 
Pislacia vera 
Prunus amygdalus 
OIL CROPS 

Monocot 
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Zea mays 
Dicot 



25 

Aleurites cordata 

A. moluccana (triloba) 
Arachis hypogea 
brassica campestris 

B. napus 

30 Cannabis sativa 

Carthami*s tinctorius 
Cocas nucifera 
Elaeis guineensis 
Glycine gracilus 
G. max 

35 G. ussuriensis 

Cossypium hirsutum 
HeliarUhus annus 
Linum usitatissimum 
Olea europaea 
Papaver somniferum 
Q Ricinus communis 
Sesamum indicum 
SUGAR CROPS 

Monocot 



55 



60 



Saccharum officinarum 
45 (officinarum x 

spontaneum) 

S. robustum 

S. sinense 

S. spontaneum 

Sorghum dochna 
50 Dicot 

Acer saccharum 
Beta vulgaris 

FORAGE AND TURF GRASSES 



Monocot 



Agropyron cristatum 
A. dese riorum 
A. elongatum 
A. intermedium 
A. smithii 
A. spicatum 
A. trachycaulum 

A. trichophorum 
Alopecurtis pratensis 
Andropogon gerardi 
Arrhenatherum elatius 

^5 Bothrichloa barbinodis 

B. ischaemum 



Cashew 

Peanut 

Pecan 

Shagbark hickory 
Chestnut 
coconut palm 
American hazel, filbert 
European hazel, cobnut 
Black walnut 
English walnut 
Walnut 
Litchi 

Queensland nut 
Pistachio nut 
Almond 



Corn 



Tung, China wood oil , 

Candlenut 

Ground nut, penut 

Rapeseed oil, canola oil 

Rapeseed oil, canona oil 

Hampseed oil 

Safflower oil 

Coconut palm 

Oil palm 

Manch, soya 

Soybean 

Wild soya 

Cottonseed oil 

Sunflower 

Rax 

Olive 

Poppy seed 
Castor bean 
Sesame 



Sugarcane 



Sugarcane 
Kans grass 

Sorgo syrup, sugar sorghum 



Sugar maple 

Sugar or mangold beet 



Crested wheatgrass 
Crested wheatgrass 
Tall wheatgrass 
Intermediate wheatgrass 
Western wheatgrass 
Blue bunch wheatgrass 
Slender wheatgrass 
Pubescen wheatgrass 
Meadow foxtail 
Big bluestem 
Tall oat grass 
Cane blestem 
King ranch bluestem 
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TABLE A-continued 



TABLE A-continued 



PLANT CLASSinCATION ACCORDING TO USE 



PLANT CLASSinCATION ACCORDING TO USE 



B. saccharoides 
Boutelotta curipendula 
B. eriopoda 
B. gracilis 
Bromns erectus 
B. inermis 
B. riparius 
Cenchrus ciliaris 
Chloris gayana 
Cymhopogon nardus 
Cynodon dactylon 
Dactylis glomerata 
Dichanthitwt annulatum 
D. aristatum 
D. sericetmt 
Digitaria decumbens 

D. smutsii 
Elymus angustus 

E. jimceus 
EragrostLs curvula 
Festuca arundinacea 

F. ovina 

F. pratensis 
F. rubra 

Lolium muUiflorum 
L. pererme 
Panicum maximum 
P. purpurascens 
P. virgatum 
Paspalum dilatatum 

P. notatum 

Permisetum clandestinum 
P. purpureum 
Phalaris arundinacea 
Phleum bertolinii 
P. pretense 
Poa fendleriana 
P. nemoralis 
P. pratensis 
Setaria sphacelata 
Sorghastrum nutans 
Sorghum halepense 
S. sudanense 
Sorghum vulgare 
FORAGE LEGUMES 

Dicot 

Coronilla varia 

Crotalaria juncea 

Lespedeza stipulacea 

L. striata 

L. sericea 

Lotus comiculatus 

L. uliginosus 

Lupinus albus 

L. angustifolius 

L. luteus 

L. mutabilis 

Medicago arabica 

M. arborea 

M. falcata 

M. hispida 

M. sativa 

M. tribuloides 

Melilotus albus 

M. officinalis 

Onobrychis viciifolia 

Omithopus sativus 

Pueraria thunbergiana 

Trifolium alexandrinum 

T augustifolium 

T. diffusion 

T. hybridum 

T. incamatum 

7! ingrescens 



Silver blues tern 
Side oats grama 
Black grama 
Blue grama 
Upright brome 
Smooth brome 
Meadow brome 
Bufifcl grass 
Rhodes grass 
Citronella grass 
Bermuda grass 
Cocksfoot 
FCleberg bluestcm 
Anglcton bluestcm 
Silky blues tem 
Pangola grass 

Altai wild rye 
Russian wild rye 
Weeping love grass 
Tall fescue 
Sheeps fescue 
Meadow fescue 
Red fescue 
Italian ryegrass 
Perennial ryegrass 
Guinea grass 
Para grass 
Switchgrass 

Dallis grass, large water 
grass 

Bahia grass 
Kikuyu grass 
Dry napier grass 
Reed canary grass 
Timothy 
Timothy 
Mutton grass 
Wood meadow grass 
Kentucky bluegrass 
Rhodesian timothy 
Indian grass 
Johnson grass 
Sudan grass 
Great millet 



Crown vetch 
Sun hemp 
Korean lespedeza 
Common lespedeza 

Birdsfoot trefoil 

Wolf bean, white lupin 
Blue lupin 

European yellow lupin 
South American lupin 
Spotted burr-clover 
Tree alfalfa 
Yellow lucerne 
California burr-clover 
Alfalfa 
Barrel medic 
White sweet clover 
Yellow sweet clover 
Sainfoin 
Serradella 
Kudzu vine 
Egyptian clover 
Fineleaf clover 
Rose clover 
Alsike clover 
Crimson clover 
Ball clover 
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T. pratense 

T. repens 

T. resupinatum 

T. subterraneum 

Trigonella foenumgraecum 

Vicia sative 

V villosa 

V atropurpurea 
V. angustifolia 

V dasycarpa 
v. ervilia 

V, pannonica 

V calcarata 

FIBER PLANTS AND WOODY PLANTS 
Monocot 

Bambusa vulgaris 
Dicot 



Agave sisalana 
Boehmeria nivea 
Cannabis indica 
C. sativa 
Ceiba pentandra 
25 Corchorus mucronata 
{striata) 

Gossypium arboreum 
G. barbadense 
G. herbaceum 
G. hirsutum 
30 G. [OOlbJnanking 
Linum angustifolium 
L. usitatissimum 
Musa textiles 
DRUG CROPS 

35 P^cot 



Angelica archangelica 

Chrysanthemum 

cinerariifolium 

Camellia sinensis 

C. coccineum 

Coffea arabica 

C. canephora 

Cola acuminata 

Nicotiana rustica 

N. tabacum 

Papaver dubium 

P. somniferum 

Theobroma cacao 

SPICES AND FLAVORINGS 



45 



Monocot 

50 Vanilla fragrans 
Dicot 

Artemisa dracunculus 
Cinnamomum zeylanicum 
Hibiscus esculentus 
55 Salvia officinalis 
Thymus vulgaris 
Pimpinella aniswn 
Mentha . arvensis 
M. piperita 
M. viridis 

Coriandrum sativum 



60 



Red clover 
White clover 
Persian clover 
Subterranean clover 
Fenugreek 
Common vetch 
Hairy vetch 
Purple vetch 
Narrowleaf vetch 
Wooly pod vetch 
Monantha (bitter) vetch 
Hungarian vetch 
Bard vetch 



Bamboo 



Sisal hemp 
Rhea fiber, ramie 
Hemp 
Hemp 

Silk cotton tree, kapok tree 
Hemp 

Tree cotton 
Egyptian cotton 
Cotton 

Upland cotton 
Oriental cotton 
Wild flax 
Flax 

Manila hemp, abaca 



Angelica 

Palm pyrethrum 

Chinese tea 

Pyrethrum 

Coffee 

Ouillow coffee 
Kola nut 
Tobacco 
Tobacco 
Poppy 

Ojiium poppy 
cocoa 



Tarragon 

Cinnamon tree 

Okra 

Sage 

Thyme 

Anise 

Menthol 

Peppermint 

Spearmint 

Coriander 



Table B: REPRESENTATIVE PLANT PESTS 
Coleoptera: 

Diabrotica, Melanotus, Agriotes, Limonius, Dalopius, 
Eleodes, Chaetocnema, Macrodactylus, Sphenophorus, 
Sitophilus, Lisorhoptrus, Oulema, Rhyzopertha, 
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Prostephanus, Phyllophage, Cyclocephala, Popillia, 
Anthonomus, Zabrotes, Leptinotarsa 
Lepidoptera: 

Heliothis, Ostrinia, Diatraea, Elasmopalpus, Papaipema, 
Agrotis, Loxagrolis, Euxoa, Peridroma saucia, 
Chorizagrotis, Spodoptera, Pseudaletia, Chilo, Busseola, 
Sesamia, Eldana, Maliarpha, Scirpophaga, Duataea, 
Rupela, Sitotroga cerealella, Sitroga, Plodia 
interpunctella, Crambus, Mythimna, Nola, Pectinophora, 
Acontia, Trichoplusia, Anticarsia, Pseudoplusia, 
Manduca, Leptinotarsa, Lema 

Thysanoptera: 

Frankliniella, Anaphothrips, Hercothrips, Stenothrips 
Homoptera: 

Dalbulus, Cicadulina, Rhopalosiphum, Melanaphis, 
Anuraphis, Prosapia, Nilaparvata, Sogatella, Laodelphax, 
Sogatodes, Nephotettix, Reciian, Cofana, Empoasca, 
Poophilus, Schizaphis, Sipha, Paratrioza, Empoasca, 
Ophilia Scleroracus, Macrosteles, Circulifer, 
Aceratagallia, Agallia, Myzus, Macrosiphum, Aphis 

Diptera: 

Delia platura, Euxesta, Diopsis, Atherigona, Hydrellia, 

Orseolia, Chironomus, Contarinia 
Orthoptera: 

Melanoplus, Schistocerca, Sphenarium, Aneolamia 
Isoptera: 

Microtermes, Macrotemes, AJlodontermes, Odonlolermes 
Heleroptera: 

Nezara, Acrosternum, Euschistus, Blissus 
Acarina: 

Tetranychus, Paratetranychus, Oligonychus 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES s 24 

(2) INFORMATION FOR SEQ ID NO: 1: 
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-continued 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sorghum bicolor 

(vii) IMMEDIATE SOURCE: 

(B) CLONE; P-45 0-Tyr 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

CCGGCTAGCT AGCTCATCGG GTGATCGATC AGTGAGCTCT CTCTTTGGCC TAGCTAGCTG 6 0 

CTAGCAGTGC AGGTAGCCAA TCAAAGCAGA AGAACTCGAT CGATCGATCA TCACGATCGC 12 0 

TGCTAGCTAG CTAGCTGCTC GCTCTCACAC TAGCTACGTG TTTTTGTTAA TTTGATATAT 18 0 

ATATATAATG GCGACAATGG AGGTAGAGGC CGCGGCCGCC ACGGTGCTGG CCGCGCCCTT 24 0 

GCTGTCCTCC TCCGCGATCC TCAAACTGCT GCTATTCGTA GTGACGCTCT CGTACCTGGC 300 

CCGAGCCCTG AGGCGGCCAC GCAAAAGCAC CACCAAGTGC AGCAGCACAA CGTGCGCCTC 36 0 

GCCCCCGGCC GGCGTTGGCA ACCCGCCGCT CCCACCGGGT CCCGTGCCGT GGCCCGTCGT 42 0 

CGGCAACCTG CCGGAGATGC TGCTGAACAA GCCGGCATTC CGCTGGATCC ACCAGATGAT 48 0 

GCGCGAGATG GGCACGGACA TCGCCTGCGT CAAGCTTGGC GGCGTCCACG TCGTGTCCAT 54 0 

CACCTGCCCG GAGATCGCGC GGGAGGTGCT CCGGAAGCAG GACGCCAACT TCATATCCCG 60 0 

CCCGCTCACC TTCGCCTCCG AGACGTTCAG CGGCGGGTAC CGGAACGCCG TGCTCTCGCC 660 

CTACGGCGAC CAGTGGAAGA AGATGCGCCG CGTCCTCACC TCCGAGATCA TCTGCCCGTC 72 0 

CCGCCACGCC TGGCTCCACG ACAAGCGCAC CGACGAGGCC GACAACCTCA CCCGCTACGT 78 0 

CTACAACCTC GCCACCAAAG CCGCCACCGG CGACGTCGCC GTCGACGTCA GGCACGTCGC 84 0 

TCGTCACTAT TGCGGCAACG TTATCCGCCG CCTCATGTTC AACAGGCGCT ACTTCGGCGA 90 0 

GCCCCAGGCT GACGGCGGTC CGGGGCCGAT GGAGGTGCTG CATATGGACG CCGTGTTCAC 96 0 

CTCCCTCGGC CTCCTCTACG CCTTCTGCGT CTCCGACTAC CTCCCCTGGC TGCGGGGCCT 102 0 

CGACCTCGAC GGCCACGAGA AGATCGTCAA GGAGGCTAAC GTGGCGGTGA ACAGGCTCCA 108 0 

CGACACGGTC ATCGACGACC GGTGGAGGCA GTGGAAGAGC GGCGAGCGGC AGGAGATGGA 114 0 

GGACTTCCTG GATGTGCTCA TCACTCTCAA GGACGCCCAG GGCAACCCGC TGCTGACCAT 1200 

CGAGGAGGTC AAAGCGCAGT CACAGGACAT CACGTTCGCG GCGGTGGACA ACCCGTCGAA 126 0 

CGCCGTGGAG TGGGCGCTGG CAGAGATGGT GAACAACCCG GAGGTGATGG CGAAGGCGAT 132 0 

GGAGGAGCTG GACCGCGTCG TCGGACGGGA GAGGCTAGTG CAGGAGTCGG ACATTCCGAA 138 0 

GCTCAACTAC GTGAAGGCCT GCATCCGGGA GGCTTTCCGT CTGCACCCGG TGGCGCCCTT 144 0 

CAACGTGCCC CACGTCGCGC TCGCCGACAC CACCATCGCC GGCTACCGCG TTCCCAAGGG 150 0 

CAGCCACGTG ATCCTGAGCC GCACGGGGCT GGGCCGCAAC CCGCGCGTGT GGGACGAGCC 156 0 

CCTGCGCTTC TACCCGGACC GACACCTCGC CACCGCCGCG TCCGACGTCG CGCTCACCGA 162 0 

GAACGACCTG CGGTTCATCT CCTTCAGCAC CGGCCGCCGC GGCTGCATCG CCGCGTCGCT 1680 

CGGCACCGCC ATGAGCGTCA TGCTCTTCGG AAGGCTCCTG CAGGGGTTCA CCTGGAGCAA 174 0 
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-continued 

GCCCGCCGGG GTGGAGGCCG TGGACCTCAG CGAGTCCAAG AGCGACACCT TCATGGCCAC 180 0 

CCCGCTGGTG CTGCACGCTG AGCCCAGGCT GCCGGCGCAC CTCTACCCGT CCATCTCCAT 1860 

CTGATTAAAC GTACGGCCGG TCGTCATTAT ATTGTATGCA TATAATTAAA GACGAGCGAG 1920 

CCTGCTGGTC ACACTTGCAT TGCATGTATC ATCAGCAGGG GGCTATGCAA TAAGTTTTTT 1980 

TTTTCCGCGC TTGATTTCGT GGTGCTGTGC GTATTCTGCG CACACCGACT GTACGTACGA 2040 

CGGCGTTCAG CTTTGTATTG TACCGAGTTA AAAAGTATTA TTATTATTAT CATCGACAAT 210 0 

AATAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 214 3 

(2) INFORMATION FOR SEQ ID HO: 2: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 5 58 amino acids 

( B ) TYPE : amino ac id 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sorghum bicolor 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: P-45 0-Tyr 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Thr Met Glu Val Glu Ala Ala Ala Ala Thr Val Leu Ala Ala 
15 10 15 

Pro Leu Leu Ser. Ser Ser Ala lie Leu Lys Leu Leu Leu Phe Val Val 
20 25 30 

Thr Leu Ser Tyr Leu Ala Arg Ala Leu Arg Arg Pro Arg Lys Ser Thr 
35 40 45 

Thr Lys Cys Ser Ser Thr Thr Cys Ala Ser Pro Pro Ala Gly Val Gly 
50 55 60 

Asn Pro Pro Leu Pro Pro Gly Pro Val Pro Trp Pro Val Val Gly Asn 
65 70 75 80 

Leu Pro Glu Met Leu Leu Asn Lys Pro Ala Phe Arg Trp lie His Gin 
85 90 95 

Met Met Arg Glu Met Gly Thr Asp lie Ala Cys Val Lys Leu Gly Gly 
100 105 110 

Val His Val Vol. Ser lie Thr Cys Pro Glu lie Ala Arg Glu Val Leu 
115 120 125 

Arg Lys Gin Asp Ala Asn Phe lie Ser Arg Pro Leu Thr Phe Ala Ser 
130 135 140 

Glu Thr Phe Ser Gly Gly Tyr Arg Asn Ala Val Leu Ser Pro Tyr Gly 
145 150 155 160 

Asp Gin Trp Lys Lys Met Arg Arg Val Leu Thr Ser Glu lie lie Cys 
165 170 175 

Pro Ser Arg His Ala Trp Leu His Asp Lys Arg Thr Asp Glu Ala Asp 
180 185 190 

Asn Leu Thr Arg Tyr Val Tyr Asn Leu Ala Thr Lys Ala Ala Thr Gly 
195 200 205 

Asp Val Ala Val Asp Val Arg His Val Ala Arg His Tyr Cys Gly Asn 
210 215 220 

Val lie Arg Arg Leu Met Phe Asn Arg Arg Tyr Phe Gly Glu Pro Gin 
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Ala Asp Gly Gly Pro Gly Pro Met Glu Val Leu His Met Asp Ala Val 

245 250 255 

Phe Thr Ser Leu Gly Leu Leu Tyr Ala Phe Cys Val Ser Asp Tyr Leu 
260 265 270 

Pro Trp Leu Arg Gly Leu Asp Leu Asp Gly His Glu Lys He Val Lys 
275 280 285 

Glu Ala Asn Val Ala Val Asn Arg Leu His Asp Thr Val He Asp Asp 
290 295 300 

Arg Trp Arg Gin Trp Lys Ser Gly Glu Arg Gin Glu Met Glu Asp Phe 
305 310 315 320 

Leu Asp Val Leu He Thr Leu Lys Asp Ala Gin Gly Asn Pro Leu Leu 

325 330 335 



Thr He Glu Glu 
340 

Val Asp Asn Pro 
355 

Asn Asn Pro Glu 
370 

Val Gly Arg Glu 
385 

Tyr Val Lys Ala 



Pro Phe Asn Val 
420 

Tyr Arg Val Pro 
435 



Val Lys Ala Gin 



Ser Asn Ala Val 
360 

Val Met Ala Lys 
375 

Arg Leu Val Gin 
390 

Cys He Arg Glu 
405 

Pro His Val Ala 



Lys Gly Ser His 
440 



Ser Gin Asp He 
345 

Glu Trp Ala Leu 



Ala Met Glu Glu 

380 

Glu Ser Asp He 
395 

Ala Phe Arg Leu 
410 

Leu Ala Asp Thr 
425 

Val He Leu Ser 



Thr Phe Ala Ala 
350 

Ala Glu Met Val 
3 65 

Leu Asp Arg Val 



Pro Lys Leu Asn 
400 

His Pro Val Ala 
415 

Thr He Ala Gly 
430 

Arg Thr Gly Leu 
445 



Gly Arg Asn Pro Arg Val Trp Asp 
450 455 

Arg His Leu Ala Thr Ala Ala Ser 
465 470 

Leu Arg Phe He Ser Phe Ser Thr 
485 



Glu Pro Leu Arg Phe Tyr Pro Asp 
460 

Asp Val Ala Leu Thr Glu Asn Asp 
475 480 

Gly Arg Arg Gly Cys He Ala Ala 

490 495 



Ser Leu Gly Thr Ala Met Ser Val Met Leu Phe Gly Arg Leu Leu Gin 
500 505 510 

Gly Phe Thr Trp Ser Lys Pro Ala Gly Val Glu Ala Val Asp Leu Ser 
515 520 525 

Glu Ser Lys Ser Asp Thr Phe Met Ala Thr Pro Leu Val Leu His Ala 
530 535 540 

Glu Pro Arg Leu Pro Ala His Leu Tyr Pro Ser He Ser He 
545 550 555 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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Met Ala Thr Met Glu Val Glu Ala Ala Ala Ala Thr Val Leu Ala Ala 
15 10 15 

Pro 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 7 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 

Val Trp Asp Glu Pro Leu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Tyr Val Tyr Asn Leu Ala Thr Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

Ser Asp Thr Phe Met Ala Thr Pro Leu Val Ser Ser Ala Glu Pro Arg 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

<iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Gin Ser Gin Asp lie Thr Phe Ala Ala Val Asp Asn Pro Ser Asn 
15 10 15 

Ala Val Glu Xaa Ala Leu Ala Glu Met Val Asn Asn Pro Glu Val Met 
20 25 30 

Ala Lys 

(2) INFORMATION FOR SEQ ID NO: 8; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Gin Gly Asn Pro Leu Leu Thr lie Glu Glu Val Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Leu Val Gin Glu Ser Asp lie Pro Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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lie Ser Phe Ser Thr Gly 
1 5 
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(2) INFORMATION FOR SEQ ID NO; 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

<iii) ANTI- SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Leu Pro Ala His Leu Tyr Pro Ser lie Ser Leu His 
15 10 



(2) INFORMATION FOR SEQ ID NO; 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12: 

Met Asp Leu Ala Asp He Pro Lys Gin Gin Arg Leu Met Ala Gly Asn 
15 10 15 

Ala Leu Val Val 
20 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13: 

Ala Arg Leu Ala Glu He Phe Ala Thr He He 
15 10 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 
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(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Asp Phe Thr Val Thr Thr Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii> MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 15: 

Gin Tyr Ala Ala Leu Gly Ser Val Phe Thr Val Pro lie lie 
15 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Xaa Xaa Pro Phe Pro lie 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligonucleotide specifying AA sequence MEVEAA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17: 

ATGGARGTNG ARGCNGC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligonucleo-tide specifying AA sequence DFTMAT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 18: 

GAYACNTTYA TGGCNAC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: TYROLlb 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 19: 

CGGGATCCAT ATGCTGCTGT TATTAGCAGT TTTTCTGTCG TA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: TYROL2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GACCGGCCGA AGCTTTAATT AGATGGAGAT GGA 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: Tyro 13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21: 

AGTGGATCCA GCGGAATGCC GGCTT 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 17 base pairs 

(B ) TYPE : nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

<iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B> CLONE: TYROL4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CGTCATGCTC TTCGGAA 

(2) INFORMATION FOR SEQ ID NO; 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANT I- SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: TYROL Id 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CGGGATCCAT ATGGCTCTGT TATTAGCAGT TTTTCTGTCG TACCTGGCCC G 51 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide with an EcoRI site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GCAGGAATTC CGG 



What is claimed is: 

1. An isolated cytochrome P-450 monooxygenase that 
catalyzes the conversion of an amino acid to the correspond- 
ing N-hydroxyamino acid and the conversion of said 5Q 
N-hydroxyamino acid to the corresponding oxime. 

2. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 1, wherein said monooxygenase is obtained 
from a plant that produces cyanogenic glycosides or glu- 
cosinolates. 55 

3. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 1, wherein said monooxygenase is obtained 
from a plant selected from the group consisting of the genera 
Sorghum, Trifolium, Linum, Taxus, Triglochin, Mannihot, 
Amygdalus, Prunus, and cruciferous plants. 

4. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 3, wherein said monooxygenase is obtained 
from Sorghum hicolor, 

5. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 1, wherein said amino acid is selected from the 
group consisting of tyrosine, phenylalanine, tryptophan, 65 
valine, leucine, iso leucine, and cyclopentenyl glycine isoleu- 
cine. 



6. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 5, wherein said amino acid is tyrosine. 

7. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 1, wherein said monooxygenase has a molecular 
weight of 57 kD, as determined by SDS-PAGE. 

8. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 1, wherein said monooxygenase comprises an 
N-terminal amino acid sequence as shown in SEQ ID NO: 
3. 

9. An isolated cytochrome P-450 monooxygenase accord- 
ing to claim 1, wherein said monooxygenase comprises an 
amino acid sequence as shown in SEQ ID NO: 2. 

10. An isolated cytochrome P-450 monooxygenase 
according to claim 1, wherein said monooxygenase com- 
prises an intemal amino acid sequence selected from the 
group consisting of SEQ ID NOs: 4-11. 

11. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an intemal amino acid sequence as shown in SEQ ID 
NO: 4. 
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12. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an internal amino acid sequence as shown in SEQ ID 
NO: 5. 

13. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an internal amino acid sequence as shown in SEQ ID 
NO: 6. 

14. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an internal amino acid sequence as shown in SEQ ID 
NO: 7. 

15. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an internal amino acid sequence as shown in SEQ ID 
NO: 8. 

16. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an internal amino acid sequence as shown in SEQ ID 
NO: 9. 

17. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an internal amino acid sequence as shown in SEQ ID 
NO: 10. 

18. An isolated cytochrome P-450 monooxygenase 
according to claim 10, wherein said monooxygenase com- 
prises an internal amino acid sequence as shown in SEQ ID 
NO: 11. 

19. A method of producing an isolated cytochrome P-450 
monooxygenase according to claim 1, comprising: 

a) engineering a gene encoding said monooxygenase to be 
expressible in a host organism; 

b) transforming said host organism with the engineered 
gene; and 

c) isolating the monooxygenase from the host organism or 
a culture supernatant, 

20. A method according to claim 19, wherein the host 
organism is selected from the group consisting of bacteria, 
yeast, and insect cells. 

21. An isolated cytochrome P-450 monooxygenase that 
catalyzes the conversion of an oxime to the corresponding 
nitrile and the conversion of said nitrilc to the corresponding 
cyanohydrine. 

22. An isolated cytochrome P-450 monooxygenase 
according to claim 21, wherein said monooxygenase is 
obtained from a plant that produces cyanogenic glycosides 
or glucosinolates. 

23. An isolated cytochrome P-450 monooxygenase 
according to claim 21, wherein said monooxygenase is 
obtained from a plant selected from the group consisting of 
the genera Sorghum, Trifolium, Linum, Taxus, Triglochin, 
Mannihot, Amygdalus, Prunus, and cruciferous plants. 

24. An isolated cytochrome P-450 monooxygenase 
according to claim 23, wherein said monooxygenase is 
obtained from Sorghum bicolor. 
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25. An isolated cytochrome P-450 monooxygenase 
according to claim 21, wherein said oxime is obtained by the 
conversion of an amino acid to the corresponding 
N-hydroxyamino acid and the conversion of said 

5 N-hydroxyamino acid to said oxime by another cytochrome 
P-450 monooxygenase. 

26. An isolated cytochrome P-450 monooxygenase 
according to claim 25, wherein said amino acid is selected 
from the group consisting of tyrosine, phenylalanine, 

10 tryptophan, valine, leucine, isoleucine, and cyclopentenylg- 
lycine isoleucine. 

27. An isolated cytochrome P-450 monooxygenase 
according to claim 26, wherein said amino acid is tyrosine. 

28. An isolated cytochrome P-450 monooxygenase 
15 according to claim 21, wherein the ability of said monooxy- 
genase to convert an oxime to the corresponding nitrile 
depends on the presence of NADPH and wherein this 
dependency can be overcome by the addition of reductants. 

29. An isolated cytochrome P-450 monooxygenase 
20 according to claim 21, wherein said monooxygenase has a 

molecular weight of 51 kD, as determined by SDS-PAGE. 

30. An isolated cytochrome P-450 monooxygenase 
according to claim 21, wherein said monooxygenase com- 
prises an N-terminal amino acid sequence as shown in SEQ 

25 ID NO: 12. 

31. An isolated cytochrome P-450 monooxygenase 
according to claim 21, wherein said monooxygenase com- 
prises an amino acid sequence selected from the group 
consisting of SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 

30 15, and SEQ ID NO: 16. 

32. An isolated cytochrome P-450 monooxygenase 
according to claim 31, wherein said monooxygenase com- 
prises an amino acid sequence as shown in SEQ ID NO: 13. 

33. An isolated cytochrome P-450 monooxygenase 
35 according to claim 31, wherein said monooxygenase com- 
prises an amino acid sequence as shown in SEQ ID NO: 14. 

34. An isolated cytochrome P-450 monooxygenase 
according to claim 31, wherein said monooxygenase com- 
prises an amino acid sequence as shown in SEQ ID NO: 15. 

40 35. An isolated cytochrome P-450 monooxygenase 
according to claim 31, wherein said monooxygenase com- 
prises an amino acid sequence as shown in SEQ ID NO: 16. 

36. A method of producing an isolated cytochrome P-450 
monooxygenase according to claim 21, comprising: 

45 a) engineering a gene encoding said monooxygenase to be 
expressible in a host organism; 

b) transforming said host organism with the engineered 
gene; and 

c) isolating the monooxygenase from the host organism or 
a culture supernatant. 

37. A method according to claim 36, wherein the host 
organism is selected from the group consisting of bacteria, 
yeast, and insect cells, 

9|C «tt « 4c 4t 
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[57] ABSTRACT 

The invention provides a human cytochrome P450 
(HUCYP) and polynucleotides which identify and encode 
HUCYP. The invention also provides expression vectors, 
host ceUs, antibodies, agonists, and antagonists. The inven- 
tion also provides methods for diagnosing, treating, or 
preventing disorders associated with expression of HUCYP. 

11 Claims, 5 Drawing Sheets 
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1 

HUMAN CYTOCHROME P450 

HELD OF THE INVENTION 

This invention relates to nucleic acid and amino acid 
sequences of a human cytochrome P450 and to the use of 
these sequences in the diagnosis, treatment, and prevention 
of cell proliferative, developmental, autoimmune/ 
inflammatory, and metabolic disorders. 

BACKGROUND OF THE INVENTION 

Members of the cytochrome P450 superfamily of 
enzymes catalyze the oxidative metabolism of a variety of 
substrates, including natural compounds such as steroids, 
fatty acids, prostaglandins, leukotrienes, and vitamins, as 
well as drugs, carcinogens, mutagens, and xenobiotics. 
Cytochrome P450s, also known as P450 heme-thiolate 
proteins, usually act as terminal oxidases in multi- 
component electron transfer chains, called P450-containing 
monooxygenase systems. Specific reactions catalyzed 
include hydroxylation, epoxidation, N-oxidation, 
sulfooxidation, N-, S-, and O-dealkylations, desulfation, 
deamination, and reduction of azo, nitro, and N-oxide 
groups. These reactions are involved in steroidogenesis of 
glucocorticoids, Cortisols, estrogens, and androgens in ani- 
mals; insecticide resistance in insects; herbicide resistance 
and flower coloring in plants; and environmental bioreme- 
diation by microorganisms. Cytochrome P450 actions on 
drugs, carcinogens, mutagens, and xenobiotics can result in 
detoxification or in conversion of the substance to a more 
toxic product. Cytochrome P450s are abundant in the liver, 
but also occur in other tissues. (See ExPASY ENZYME EC 
1.14.14.1; Prosite PDOC00081 Cytochrome P450 cysteine 
heme-iron ligand signature; PRINTS EP450I E-Class P450 
Group I signature; and Graham-Lorence, S. and Peterson, J. 
A. (1996) FASEB J. 10:206-214.) 

Four hundred cytochrome P450s have been identified in 
diverse organisms including bacteria, fungi, plants, and 
animals (Graham-Lorence, supra). The B-class is found in 
prokaryotes and fungi, while the E-class is found is found in 
bacteria, plants, insects, vertebrates, and mammals. Five 
subclasses or groups are found within the larger family of 
E-class cytochrome P450s (PRINTS EP450I E-Class P450 
Group I signature). 

All cytochrome P450s use a heme co factor and share 
structural attributes. Most cytochrome P450s are 400 to 530 
amino acids in length. The secondary structure of the 
enzyme is about 70% alpha-helical and about 22% beta- 
sheet. The region around the heme-binding site in the 
C-terminal part of the protein is conserved among cyto- 
chrome P450s, A ten amino acid signature sequence in this 
hemeiron ligand region has been identified which includes a 
conserved cysteine involved in binding the heme iron in the 
fifth coordination site. In eukaryotic cytochrome P450s, a 
membrane -spanning region is usually found in the first 
15-20 amino acids of the protein, generally consisting of 
approximately 15 hydrophobic residues followed by a posi- 
tively charged residue. (See Prosite PDOC0081, supra; 
Graham-Lorence, supra.) 

Cytochrome P450 enzymes are involved in ceU prolifera- 
tion and development. The enzymes have roles in chemical 
mutagenesis and carcinogenesis by metabolizing chemicals 
to reactive intermediates that form adducts with DNA 
(Nebert, D. W. and Gonzalez, R J. (1987) Ann. Rev. Bio- 
chem, 56:945-993), These adducts can cause nucleotide 
changes and DNA rearrangements that lead to oncogenesis. 
Cytochrome P450 expression in liver and other tissues is 
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induced by xenobiotics such as polycyclic aromatic 
hydrocarbons, peroxisomal proliferators, phenobarbital, and 
the glucocorticoid dexamethasone (Dogra, S. C. et al. (1998) 
CHn. Exp. Pharmacol. Physiol. 25:1-9). A cytochrome P450 

5 protein may participate in eye development as mutations in 
the P450 gene CYPIBI cause primary congenital glaucoma 
(Online Mendelian Inheritance in Man (OMIM) * 601 771 
Cytochrome P450, subfamily I (dioxin-inducible), polypep- 
tide 1; CYPIBI). Cytochrome P450s arc also involved in 

10 drug interactions, because the induction of a cytochrome 
P450 by one drug may affect the metabolism of another drug 
by the enzyme (Katzung, B. G. (1995) Basic and Clinical 
Pharmacoloyj Appleton & Lange, Norwalk Conn., pp. 
48-59). 

^5 Cytochrome P450s are associated with inflammation and 
infection. Hepatic cytochrome P450 activities are pro- 
foundly affected by various infections and inflammatory 
stimuli, some of which are suppressed and some induced 
(Morgan, E. T. (1997) Drug Metab. Rev. 29:1129-1188). 

20 Effects observed in vivo can be mimicked by proinflamma- 
tory cytokines and interferons. Autoantibodies to two cyto- 
chrome P450 proteins were found in patients with autoim- 
mune polyenodocrinopathy-candidiasis-ectodermal 
dystrophy (APECED), a polyglandular autoimmune syn- 

25 drome (OMIM * 240300 Autoimmune polyendocrinopathy- 
candidiasis-ectodermal dystrophy). 

Mutations in cytochrome P450s have been linked to 
metabolic disorders, including congenital adrenal 
hyperplasia, the most common adrenal disorder of infancy 
and childhood; pseudovitamin D -deficiency rickets; cere- 
brotendinous xanthomatosis, a lipid storage disease charac- 
terized by progressive neurologic dysfunction, premature 
atherosclerosis, and cataracts; and an inherited resistance to 
the anticoagulant drugs coumarin and warfarin (Isselbacher, 
K. J. et al. (1994) Harrison's Principles of Internal 
Medicine^ McGraw-Hill, Inc. New York, N.Y., pp. 
1968-1970; Takeyama, K. et al. (1997) Science 
277:1827-1830; Kitanaka, S. et al. (1998) N. Engl. J, Med. 
338:653-661; OMIM *213700 Cerebro tendinous xanth- 
omatosis; and OMIM #122700 Coumarin resistance). 
Extremely high levels of expression of the cytochrome P450 
protein aromatase were found in a fibrolamellar hepatocel- 
lular carcinoma from a boy with severe gynecomastia 
(feminization) (Agarwal, V. R. (1998) J. Clin. Endocrinol. 

'^^ Metab. 83:1797-1800). 

The discovery of a new human cytochrome P450 and the 
polynucleotides encoding it satisfies a need in the art by 
providing new compositions which are useful in the 
diagnosis, prevention, and treatment of ceU proliferative, 
developmental, autoimmune/inflammatory, and metabolic 
disorders. 

SUMMARY OF THE INVENTION 

55 The invention is based on the discovery of a new human 
cytochrome P450 (HUCYP), the polynucleotides encoding 
HUCYP, and the use of these compositions for the diagnosis, 
treatment, or prevention of ceU proliferative, developmental, 
autoimmune/inflammatory, and metabolic disorders. 

60 The invention features a substantially purified polypep- 
tide comprising the amino acid sequence of SEQ ID NO:l 
or a fragment of SEQ ID NO:l. 

The invention further provides a substantially purified 
variant having at least 90% amino acid sequence identity to 

65 the amino acid sequence of SEQ ID NO:l or a fragment of 
SEQ ID NO:l. The invention also provides an isolated and 
purified polynucleotide encoding the polypeptide compris- 
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ing the amino acid sequence of SEQ ID NO:l or a fragment 
of SEQ ID NO:l. The invention also includes an isolated and 
purified polynucleotide variant having at least 70% poly- 
nucleotide sequence identity to the polynucleotide encoding 
the polypeptide comprising the amino acid sequence of SEQ 
ID NO:l or a fragment of vSEQ ID NO:l. 

ITie invention further provides an isolated and purified 
polynucleotide which hybridizes tuider stringent conditions 
to the polynucleotide encoding the polypeptide comprising 
the amino acid sequence of SEQ ID NO:l or a fragment of 
SEQ ID NO:l, as well as an isolated and purified polynucle- 
otide having a sequence which is complementary to the 
polynucleotide encoding the polypeptide comprising the 
amino acid sequence of SEQ ID NO:l or a fragment of SEQ 
ID NO:l. 

The invention also provides a method for detecting a 
polynucleotide in a sample containing nucleic acids, the 
method comprising the steps of (a) hybridizing the comple- 
ment of the polynucleotide sequence to at least one of the 
polynucleotides of the sample, thereby forming a hybrid- 
ization complex; and (b) detecting the hybridization 
complex, wherein the presence of the hybridization complex 
correlates with the presence of a polynucleotide in the 
sample. In one aspect, the method further comprises ampli- 
fying the polynucleotide prior to hybridization. 

The invention also provides an isolated and purified 
polynucleotide comprising the polynucleotide sequence of 
SEQ ID NO:2 or a firagment of SEQ ID NO:2, and an 
isolated and purified polynucleotide variant having at least 
70% polynucleotide sequence identity to the polynucleotide 
comprising the polynucleotide sequence of SEQ ID NO:2 or 
a fragment of SEQ ID NO:2. The invention also provides an 
isolated and purified polynucleotide having a sequence 
complementary to the polynucleotide comprising the poly- 
nucleotide sequence of SEQ ID NO:2 or a fragment of SEQ 
ID NO:2. 

The invention further provides an expression vector con- 
taining at least a fragment of the polynucleotide encoding 
the polypeptide comprising the sequence of SEQ ID NO: 1 or ^ 
a fragment of SEQ ID NO:l. In another aspect, the expres- 
sion vector is contained within a host cell. 

The invention also provides a method for producing a 
polypeptide, the method comprising the steps of: (a) cultur- 
ing the host cell containing an expression vector containing 
at least a fragment of a polynucleotide under conditions 
suitable for the expression of the polypeptide; and (b) 
recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composi- 
tion comprising a substantially purified polypeptide having 
the sequence of SEQ ID NO:l or a fragment of SEQ ID 
NO:l in conjunction with a suitable pharmaceutical carrier. 

The invention fiirther includes a purified antibody which 
binds to a polypeptide comprising the sequence of SEQ ID 
NO:l or a fragment of SEQ ID NO:l, as well as a purified 
agonist and a purified antagonist of the polypeptide. 

The invention also provides a method for treating or 
preventing a disorder associated with decreased expression 
or activity of HUCYP, the method comprising administering 
to a subject in need of such treatment an effective amount of 60 
a pharmaceutical composition comprising a substantially 
purified polypeptide having the amino acid sequence of SEQ 
ID NO: 1 or a fragment of SEQ ID NO:l, in conjunction with 
a suitable pharmaceutical carrier. 

The invention also provides a method for treating or 65 
preventing a disorder associated with increased expression 
or activity of HUCYP, the method comprising administering 
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to a subject in need of such treatment an effective amount of 
an antagonist of the polypeptide having the amino acid 
sequence of SEQ ID NO: 1 or a fragment of SEQ ID NO:l. 

BRIEF DESCRIPTION OF THE FIGURES AND 
TABLE 

FIGS. lA, IB, IC, ID, and IE show the amino acid 
sequence (SEQ ID NO:l) and nucleic acid sequence (SEQ 
ID NO:2) of HUCYP. The alignment was produced using 
MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco Calif.). 

Table 1 shows the tools, programs, and algorithms used to 
analyze HUCYP, along with applicable descriptions, 
references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and 
methods are described, it is understood that this invention is 
not limited to the particular machines, materials and meth- 
ods described, as these may vary. It is also to be understood 
that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not intended 
to limit the scope of the present invention which will be 
limited only by the appended claims. 

It must be noted that as used herein and in the appended 
claims, the singular forms "a," "an," and "the" include plural 
reference unless the context clearly dictates otherwise. Thus, 
for example, a reference to "a host cell" includes a plurality 
of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof 
known to those skilled in the 14 art, and so forth. 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meanings as commonly 
understood by one of ordinary skill in the art to which this 
invention belongs. Although any machines, materials, and 
methods similar or equivalent to those described herein can 
be used to practice or test the present invention, the preferred 
machines, materials and methods are now described. All 
publications mentioned herein are cited for the purpose of 
describing and disclosing the ceil fines, protocols, reagents 
and vectors which are reported in the publications and which 
might be tised in connection with the invention. Nothing 
herein is to be construed as an admission that the invention 
is not entitled to antedate such disclosure by virtue of prior 
invention. 

Definitions 

"HUCYP" refers to the amino acid sequences of substan- 
tially purified HUCYP obtained from any species, particu- 
larly a mammalian species, including bovine, ovine, porcine, 
murine, equine, and preferably the human species, from any 
source, whether natural, synthetic, semi-synthetic, or recom- 
binant. 

The term "agonist" refers to a molecule which, when 
bound to HUCYP, increases or prolongs the duration of the 
effect of HUCYP. Agonists may include proteins, nucleic 
acids, carbohydrates, or any other molecules which bind to 
and modulate the effect of HUCYP. 

An "allelic variant" is an alternative form of the gene 
encoding HUCYP. Allelic variants may result from at least 
one mutation in the nucleic acid sequence and may result in 
altered mRNAs or in polypeptides whose structure or func- 
tion may or may not be altered. Any given natural or 
recombinant gene may have none, one, or many allelic 
forms. Common mutational changes which give rise to 
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allelic variants are generally ascribed to natural deletions, 
additions, or substitutions of nucleotides. Each of these 
types of changes may occur alone, or in combination with 
the others, one or more times in a given sequence. 

"Altered" nucleic acid sequences encoding HUCYP 
include those sequences with deletions, insertions, or sub- 
stitutions of different nucleotides, resulting in a polypeptide 
the same as HUCYP or a polypeptide with at least one 
functional characteristic of HUCYP. Included within this 



hemocyanin (KLH). The coupled peptide is then used to 
immunize the animal. 

The term "antigenic determinant" refers to that fragment 
of a molecule (i.e., an epitope) that makes contact with a 
particular antibody. When a protein or a fragment of a 
protein is used to immunize a host animal, numeroxis regions 
of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (given 
regions or three-dimensional structures on the protein). An 



definition are polymorphisms which may or may not be ^0 antigenic determinant may compete with the intact antigen 



readily detectable xising a particular oligonucleotide probe of 
the polynucleotide encoding HUCYP, and improper or unex- 
pected hybridization to allelic variants, with a locus other 
than the normal chromosomal locus for the polynucleotide 
sequence encoding HUCYP. The encoded protein may also 
be "altered," and may contain deletions, insertions, or sub- 
stitutions of amino acid residues which produce a silent 
change and result in a functionally equivalent HUCYP. 
Deliberate amino acid substitutions may be made on the 
basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues, as long as the biological or immu- 
nological activity of HUCYP is retained. For example, 
negatively charged amino acids may include aspartic acid 
and glutamic acid, positively charged amino acids may 
include lysine and arginine, and amino acids with uncharged 
polar head groups having similar hydrophilicity values may 
include leucine, isoleucine, and valine; glycine and alanine; 
asparagine and glutamine; serine and threonine; and pheny- 
lalanine and tyrosine. 

The terms "amino acid" or "amino acid sequence" refer to 
an oligopeptide, peptide, polypeptide, or protein sequence, 
or a fragment of any of these, and to naturally occurring or 
synthetic molecules. In this context, "fragments," "immu- 
nogenic fragments," or "antigenic fragments" refer to frag- 
ments of HUCYP which are preferably at least 5 to about 15 
amino acids in length, most preferably at least 14 amino 
acids, and which retain some biological activity or immu- 
nological activity of HUCYP Where "amino acid sequence" 
is recited to refer to an amino acid sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like 
terms are not meant to limit the amino acid sequence to the 
complete native amino acid sequence associated with the 
recited protein molecule. 

"Amplification" relates to the production of additional 
copies of a nucleic acid sequence. Amplification is generally 
carried out using polymerase chain reaction (PCR) technolo- 
gies well known in the art. 
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(i.e., the immunogen used to elicit the immune response) for 
binding to an antibody. 

The term "antisense" refers to any composition containing 
a nucleic acid sequence which is complementary to the 
"sense" strand of a specific nucleic acid sequence. Antisense 
molecules may be produced by any method including syn- 
thesis or transcription. Once introduced into a cell, the 
complementary nucleotides combine with natural sequences 
produced by the cell to form duplexes and to block either 
transcription or translation. The designation "negative" can 
refer to the antisense strand, and the designation "positive" 
can refer to the sense strand. 

The term "biologically active," refers to a protein having 
structural, regulatory, or biochemical functions of a naturally 
occurring molecule. Likewise, "immunologically active" 
refers to the capability of the natural, recombinant, or 
synthetic HUCYP, or of any oligopeptide thereof, to induce 
a specific immune response in appropriate animals or cells 
and to bind with specific antibodies. 

The terms "complementary" or "complementarity" refer 
to the natural binding of polynucleotides by base pairing. 
For example, the sequence "5' A-G-T 3'" bonds to the 
complementary sequence "3' T-C-A 5'," Complementarity 
between two single-stranded molecules may be "partial," 
such that only some of the nucleic acids bind, or it may be 
"complete," such that total complementarity exists between 
the single stranded molecules. The degree of complemen- 
tarity between nucleic acid strands has significant effects on 
the efficiency and strength of the hybridization between the 
nucleic acid strands. This is of particular importance in 
amplification reactions, which depend upon binding 
between nucleic acids strands, and in the design and use of 
peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide 
sequence" or a "composition comprising a given amino acid 
sequence" refer broadly to any composition containing the 
given polynucleotide or amino acid sequence. The compo- 
sition may comprise a dry formulation or an aqueous solu- 



The term "antagonist" refers to a molecule which, when 50 tion. Compositions comprising polynucleotide sequences 



encoding HUCYP or fragments of HUCYP may be 
employed as hybridization probes. The probes may be stored 
in fi-eeze-dried form and may be associated with a stabilizing 
agent such as a carbohydrate. In hybridizations, the probe 
55 may be deployed in an aqueous solution containing salts 
(e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), 
and other components (e.g., Denhardt's solution, dry milk, 
salmon sperm DNA, etc.). "Consensus sequence" refers to a 
nucleic acid sequence which has been resequenced to 



bound to HUCYP, decreases the amount or the duration of 
the effect of the biological or immunological activity of 
HUCYP. Antagonists may include proteins, nucleic acids, 
carbohydrates, antibodies, or any other molecules which 
decrease the effect of HUCYP. 

The term "antibody" refers to intact molecules as well as 
to fi-agments thereof, such as Fab, F(ab')2, and Fv fragments, 
which are capable of binding the epitopic determinant. 

Antibodies that bind HUCYP polypeptides can be prepared . nr-o 1 

using intact polypeptides or using fragments containing 60 resolve uncalled bases, extended usmg XL- PCK kit (Perkm- 
small peptides of interest as the immunizing antigen. The Elmer, Norwalk Conn.) in the 5' and/or the 3' direction, and 
polypeptide or oligopeptide used to immunize an animal resequenced, or which has been assembled from the over- 
(e g a mouse, a rat, or a rabbit) can be derived from the lapping sequences of more than one Incyte Clone using a 
translation of RNA, or synthesized chemically, and can be computer program for fragment assembly, such as the GEL- 
conjugated to a carrier protein if desired. Commonly used 65 VIEW Fragment Assembly system (GCG, Madison W^.). 
carriers that are chemically coupled to peptides include Some sequences have been both extended and assembled to 
bovine serum albumin, thyroglobulin, and keyhole limpet produce the consensus sequence. 
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The term "correlates with expression of a polynucleotide" 
indicates that the detection of the presence of nucleic acids, 
the same or related to a nucleic acid sequence encoding 
HUCYP by northern analysis is indicative of the presence of 
nucleic acids encoding HUCYP in a sample, and thereby 
correlates with expression of the transcript from the poly- 
nucleotide encoding HUCYP. 

A "deletion" refers to a change in the amino acid or 
nucleotide sequence that results in the absence of one or 
more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification 
of a polypeptide sequence, or a polynucleotide sequence. 
Chemical modifications of a polynucleotide sequence can 
include, for example, replacement of hydrogen by an alkyl, 
acyl, or amino group. A derivative polynucleotide encodes a 
polypeptide which retains at least one biological or imniu- 
nological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or 
any similar process that retains at least one biological or 
immunological function of the polypeptide from which it 
was derived. 

llie term "similarity" refers to a degree of complemen- 
tarity. There may be partial similarity or complete similarity. 
The word "identity" may substitute for the word "similar- 
ity." A partially complementary sequence that at least par- 25 
tially inhibits an identical sequence from hybridizing to a 
target nucleic acid is referred to as "substantially simUar" 
The inhibition of hybridization of the completely comple- 
mentary sequence to the target sequence may be examined 
using a hybridization assay (Southern or northern blot, 30 
solution hybridization, and the like) under conditions of 
reduced stringency. A substantially similar sequence or 
hybridization probe wiU compete for and inhibit the binding 
of a completely similar (identical) sequence to the target 
sequence under conditions of reduced stringency. This is not 35 
to say that conditions of reduced stringency are such that 
non-specific binding is permitted, as reduced strmgency 
conditions require that the binding of two sequences to one 
another be a specific (i.e., a selective) interaction. The 
absence of non-specific binding may be tested by the use of 40 
a second target sequence which lacks even a partial degree 
of complementarity (e.g., less than about 30% similanty or 
identity). In the absence of non-specific binding, the sub- 
stantiaUy similar sequence or probe will not hybridize to the 
second non-complementary target sequence. 

The phrases "percent identity" or "% identity" refer to the 
percentage of sequence similarity found in a comparison of 
two or more amino acid or nucleic acid sequences. Percent 
identity can be determined electronically, e.g., by using the 
MEGALIGN program (DNASTAR) which creates align- 
ments between two or more sequences according to methods 
selected by the user, e.g., the clustal method. (See, e.g., 
Higgins, D. G. and P M. Sharp (1988) Gene 73:237-244.) 
The clustal algorithm groups sequences into clusters by 
examining the distances between all pairs. The clusters are 
aligned pairwise and then in groups. The percentage suni- 
larity between two amino acid sequences, e.g., sequence A 
and sequence B, is calculated by dividing the length of 
sequence A, minus the number of gap residues in sequence 
A, minus the number of gap residues in sequence B, into the 
sum of the residue matches between sequence A and 
sequence B, times one hundred. Gaps of low or of no 
similarity between the two amino acid sequences are not 
included in determining percentage similarity. Percent iden- 



183:626-645.) Identity between sequences can also be 
determined by other methods known in the art, e.g., by 
varying hybridization conditions. 

"Human artificial chromosomes" (HACs) are linear 
microchromosomes which may contain DNA sequences of 
about 6 kb to 10 Mb in size, and which contain all of the 
elements required for stable mitotic chromosome segrega- 
tion and maintenance. 

The term "humanized antibody" refers to antibody mol- 
ecules in which the amino acid sequence in the non-antigen 
binding regions has been altered so that the antibody more 
closely resembles a human antibody, and stiU retains its 
original binding ability. 

"Hybridization" refers to any process by which a strand of 
nucleic acid binds with a complementary strand through 
base pairing. 

The term "hybridization complex" refers to a complex 
formed between two nucleic acid sequences by virtue of the 
formation of hydrogen bonds between complementary 
bases. A hybridization complex may be formed in solution 
(e.g.. Cot or Rot analysis) or formed between one nucleic 
acid sequence present in solution and another nucleic acid 
sequence immobilized on a soUd support (e.g., paper, 
membranes, filters, chips, pins or glass slides, or any other 
appropriate substrate to which cells or their nucleic acids 
have been fi^xed). 

'^The words "insertion" or "addition" refer to changes in an 
amino acid or nucleotide sequence resulting in the addition 
of one or more amino acid residues or nucleotides, 
respectively, to the sequence found in the naturally occurring 
molecule. 

"Immune response" can refer to conditions associated 
with inflammation, trauma, immune disorders, or infectious 
or genetic disease, etc. These conditions can be character- 
ized by expression of various factors, e.g., cytokines, 
chemokines, and other signaling molecules, which may 
affect cellular and systemic defense systems. 

The term "microarray" refers to an arrangement of dis- 
tinct polynucleotides on a substrate. 

The terms "element" or "array element" in a microarray 
context, refer to hybridizable polynucleotides arranged on 
the surface of a substrate. 

The term "modulate" refers to a change in the activity of 
HUCYP. For example, modulation may cause an increase or 
a decrease in protein activity, binding characteristics, or any 
other biological, functional, or immunological properties of 
HUCYP. 

50 The phrases "nucleic acid" or "nucleic acid sequence" 
refer to a nucleotide, oligonucleotide, polynucleotide, or any 
fragment thereof. These phrases also refer to DNA or RNA 
of genomic or synthetic origin which may be single-stranded 
or double-stranded and may represent the sense or the 
55 antisense strand, to peptide nucleic acid (PNA), or to any 
DNA-like or RNA-like material. In this context, "frag- 
ments" refers to those nucleic acid sequences which, when 
translated, would produce polypeptides retaining some func- 
tional characteristic, e.g., antigenicity, or structural domain 
characteristic, e.g., ATP-binding site, of the full-length 
polypeptide. 

The terms "operably associated" or "operably linked" 
refer to functionaUy related nucleic acid sequences. A pro- 
moter is operably associated or operably linked with a 
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included in determining percentage s.mUanty. l-ercem icen- ™ ^ "^^^^Vf thTpromoter controls the translation of 
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the same reading frame, certain genetic elements, e.g., 
repressor genes, are not contiguously linked to the sequence 
encoding the polypeptide but still bind to operator sequences 
that control expression of the polypeptide. 

The term "oligonucleotide" refers to a nucleic acid 
sequence of at least about 6 nucleotides to 60 nucleotides, 
preferably about 15 to 30 nucleotides, and most preferably 
about 20 to 25 nucleotides, which can be used in PGR 
amplification or in a hybridization assay or microarray. 



according to various methods well known in the art, and may 
rely on any known method for the insertion of foreign 
nucleic acid sequences into a prokaryotic or eukaryotic host 
cell, llie method for transformation is selected based on the 
type of host cell being transformed and may include, but is 
not limited to, viral infection, electroporation, heat shock, 
lipofection, and particle bombardment. The term "trans- 
formed" cells includes stably transformed cells in which the 
inserted DNA is capable of replication either as an autono- 
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OUgonucleotide" is substantially equivalent to the terms 10 mously replicating plasmid or as Part^ of the hos^t 
"amplimer/' "primer,** "oligomer," and "probe," as these ' " ' 

terms are commonly defined in the art, 

"Peptide nucleic acid" (PNA) refers to an antisense mol- 
ecule or anti-gene agent which comprises an oligonucleotide 
of at least about 5 nucleotides in length linked to a peptide 
backbone of amino acid residues ending in lysine. The 
terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or 
RNA and stop transcript elongation, and may be pegylated 
to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample 
suspected of containing nucleic acids encoding HUCYP, or 
fragments thereof, or HUCYP itself, may comprise a bodily 
fluid; an extract from a cell, chromosome, organelle, or 
membrane isolated fi-om a cell; a cell; genomic DNA, RNA, 
or cDNA, in solution or bound to a substrate; a tissue; a 
tissue print; etc. 

The terms "specific binding" or "specifically binding" 
refer to that interaction between a protein or peptide and an 
agonist, an antibody, or an antagonist. The interaction is 
dependent upon the presence of a particular structure of the 
protein, e.g., the antigenic determinant or epitope, recog- 
nized by the binding molecule. For example, if an antibody 
is specific for epitope "A," the presence of a polypeptide 
containing the epitope A, or the presence of free unlabeled 
A, in a reaction containing free labeled A and the antibody 
will reduce the amount of labeled A that binds to the 
antibody. 

The term "stringent conditions" refers to conditions which 
permit hybridization between polynucleotides and the 
claimed polynucleotides. Stringent conditions can be 
defined by salt concentration, the concentration of organic 
solvent, e.g., formamide, temperature, and other conditions 
well known in the art. In particular, stringency can be 
increased by reducing the concentration of salt, increasing 
the concentration of formamide, or raising the hybridization 
temperature. 

The term "substantially purified" refers to nucleic acid or 
amino acid sequences that are removed from their natural 
environment and are isolated or separated, and are at least 
about 60% free, preferably about 75% free, and most 
preferably about 90% free from other components with 
which they are naturally associated. 

A "substitution" refers to the replacement of one or more 
amino acids or nucleotides by different amino acids or 
nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid 
support including membranes, filters, chips, slides, wafers, 
fibers, magnetic or nonmagnetic beads, gels, tubing, plates, 
polymers, microparticles and capillaries. The substrate can 
have a variety of surface forms, such as wells, trenches, pins, 
channels and pores, to which polynucleotides or polypep- 
tides are bound. 

"Transformation" describes a process by which exog- 
enous DNA enters and changes a recipient cell. Transfor- 
mation may occur under natural or artificial conditions 



chromosome, as well as transiently transformed cells which 
express the inserted DNA or RNA for limited periods of 
time. 

A "variant" of HUCYP polypeptides refers to an amino 
15 acid sequence that is altered by one or more amino acid 
residues. The variant may have "conservative" changes, 
wherein a substituted amino acid has similar structural or 
chemical properties (e.g., replacement of leucine with 
iso leucine). More rarely, a variant may have "nonconserva- 
20 tive" changes (e.g., replacement of glycine with tryptophan). 
Analogous minor variations may also include amino acid 
deletions or insertions, or both. Guidance in determining 
which amino acid residues may be substituted, inserted, or 
deleted without abolishing biological or immunological 
25 activity may be found using computer programs well known 
in the art, for example, LASERGENE software 
(DNASTAR). 

ITie term "variant," when used in the context of a poly- 
nucleotide sequence, may encompass a polynucleotide 
sequence related to HUCYP. This definition may also 
include, for example, "allelic" (as defined above), "splice," 
"species," or "polymorphic" variants. A splice variant may 
have significant identity to a reference molecule, but will 
generally have a greater or lesser number of polynucleotides 
due to alternate splicing of exons during mRNA processing. 
The corresponding polypeptide may possess additional 
functional domains or an absence of domains. Species 
variants are polynucleotide sequences that vary from one 
species to another. The resulting polypeptides generally will 
have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide 
sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single 
nucleotide polymorphisms" (SNPs) in which the polynucle- 
otide sequence varies by one base. The presence of SNPs 
may be indicative of, for example, a certain population, a 
disease state, or a propensity for a disease state. 



The Invention 
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The invention is based on the discovery of a new human 
cytochrome P450 (HUCYP), the polynucleotides encoding 
HUCYP, and the use of these compositions for the diagnosis, 
treatment, or prevention of cell proliferative, developmental, 
autoimmune/inflammatory, and metabolic disorders. 

Nucleic acids encoding the HUCYP of the present inven- 
tion were identified in Incyte Clone 991729Hlfi:om the 
colon cDNA library (COLNNOTll) using a computer 
search for nucleotide and/or amino acid sequence align - 
60 ments. A consensus sequence, SEQ ID NO:2, was derived 
from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 2812042H1 (OVARNOTIO), 
4502047H1 (BRAVTXT02), 2100169H1 (BRAITUT02), 
991729R6 and 991729H1 (COLNNOTll), 038338R6 
65 (HUVENOBOl), 1996448H1 (BRSTTUT03), 687353H1 
(UTRSNOT02), 1700412F6 (BLADTUT05), 4538247H1 
(OVARNOT12), and 2381571F6 (ISLTNOTOl). 
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In one embodiment, the invention encompasses a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO:l. as shown in FIGS. lA, IB, IC, ID, and IE. HUCYP 
is 462 amino acids in length and has three potential 
N-glycosylation sites at residues N94, N217, and N246; 
eleven potential casein kinase 2 phosphorylation sites as 
residues T36, S53, S116, S155, S183, T184, T293, T294, 
T442, S449, and S450; and seven potential protein kinase C 
phosphorylation sites at residues S134, T219, T281, T333, 
T336, S449, and S459. PFAM analysis indicates that 
HUCYP resembles P450 proteins from 132 through R46 1. 
PRINTS analysis indicates that HUCYP resembles E-class 
P450 Group I proteins from H62 through L81, from C285 
through G311, from A367 through D391; resembles E-class 
P450 Group II proteins from 0265 through 7^93, from T294 
through G311, and from T377 through E392; resembles 
E-class P450 Group IV proteins from E56 through V79, 
from L372 through D390, and from C409 through V427; 
resembles P450 superfamily proteins from T294 through 
N307; and resembles B-class P450 proteins from 0373 
through R388. HUCYP shares conserved residues with 
cytochrome P450s in the potential heme-iron ligand region 
at F403, S404, G405, C409, and P410. C409 is the potential 
heme-binding residue. Like other etikaryotic cytochrome 
P450s, HUCYP contains a potentially membrane -spanning 
region in its first 15 to 20 residues. SPScan and HMM 
indicate that HUCYP has a potential signal peptide from Ml 
through about A18 or a potential transmembrane sequence 
from Ml through about Y23. A fragment of SEQ ID NO:2 
from about nucleotide 109 to about nucleotide 153 is useful 
in hybridization or amplification technologies to identify 
SEQ ID NO:2 and to distinguish between SEQ ID NO:2 and 
a related sequence. Northern analysis shows the expression 
of this sequence in various Libraries, 72% of which are 
associated with cancer and cell proliferation, 28% of which 
are associated with inflammation and immune response, 
25% of which are reproductive tissues, 19% of which are 
nervous tissues, 19% of which are gastrointestinal tissues, 
and 16% of which are cardiovascular tissues. Of particular 
note is the expression of HUCYP in cancers of the liver, 
colon, uterus, testes, lung, breast, brain, and bladder, and in 
leukemia. 

The invention also encompasses HUCYP variants. A 
preferred HUCYP variant is one which has at least about 
80%, more preferably at least about 90%, and most prefer- 
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ably at least about 95% amino acid sequence identity to the 
HUCYP amino acid sequence, and which contains at least 
one functional or structural characteristic of HUCYP. 

The invention also encompasses polynucleotides which 
encode HUCYP. In a particular embodiment, the invention 
encompasses a polynucleotide sequence comprising the 
sequence of SEQ ID NO:2, which encodes HUCYP. 

The invention also encompasses a variant of a polynucle- 
otide sequence encoding HUCYP. In particular, such a 
variant polynucleotide sequence will have at least about 
70%, more preferably at least about 85%, and most prefer- 
ably at least about 95% polynucleotide sequence identity to 
the polynucleotide sequence encoding HUCYP. A particular 
aspect of the invention encompasses a variant of SEQ ID 
NO: 2 which has at least about 70%, more preferably at least 
about 85%, and most preferably at least about 95% poly- 
nucleotide sequence identity to SEQ ID NO;2, Any one of 
the polynucleotide variants described above can encode an 
amino acid sequence which contains at least one functional 
or structural characteristic of HUCYP. 

It will be appreciated by those skilled in the art that as a 
result of the degeneracy of the genetic code, a multitude of 



polynucleotide sequences encoding HUCYP, some bearing 
minimal similarity to the polynucleotide sequences of any 
known and naturally occurring gene, may be produced. 
Thus, the invention contemplates each and every possible 
variation of polynucleotide sequence that could be made by 
selecting combinations based on possible codon choices. 
These combinations are made in accordance with the stan- 
dard triplet genetic code as applied to the polynucleotide 
sequence of naturally occurring HUCYP, and all such varia- 
tions are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode HUCYP 
and its variants are preferably capable of hybridizing to the 
nucleotide sequence of the naturally occurring HUCYP 
under appropriately selected conditions of stringency, it may 
be advantageous to produce nucleotide sequences encoding 
HUCYP or its derivatives possessing a substantially differ- 
ent codon tisage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rale at which 
expression of the peptide occurs in a particular prokaryotic 
or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by the host. Other 
reasons for substantially altering the nucleotide sequence 
encoding HUCYP and its derivatives without altering the 
encoded amino acid sequences include the production of 
RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally 
occurring sequence. 

The invention also encompasses production of DNA 
sequences which encode HUCYP and HUCYP derivatives, 
or fragments thereof, entirely by synthetic chemistry. After 
production, the synthetic sequence may be inserted into any 
of the many available expression vectors and cell systems 
using reagents well known in the art. Moreover, synthetic 
chemistry may be used to introduce mutations into a 
sequence encoding HUCYP or any fragment thereof. 

Also encompassed by the invention are polynucleotide 
sequences that are capable of hybridizing to the claimed 
polynucleotide sequences, and, in particular, to those shown 
in SEQ ID NO:2, or to a fragment of SEQ ID NO:2, under 
various conditions of stringency. (See, e.g., Wahl, G. M. and 
S. L. Berger (1987) Methods Enzymol. 152:399^07; 
Kimmel, A. R. (1987) Methods Enzymol. 152:507-511.) For 
example, stringent salt concentration will ordinarily be less 
than about 750 mM NaCl and 75 mM trisodium citrate, 
45 preferably less than about 500 mM NaCl and 50 mM 
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trisodium citrate, and most preferably less than about 250 
mM NaCl and 25 mM trisodium citrate. Low stringency 
hybridization can be obtained in the absence of organic 
solvent, e.g., formamide, while high stringency hybridiza- 
tion can be obtained in the presence of at least about 35% 
formamide, and most preferably at least about 50% forma- 
mide. Stringent temperature conditions will ordinarily 
include temperatures of at least about 30** C, more prefer- 
ably of at least about 37** C, and most preferably of at least 
about 42° C, Varying additional parameters, such as hybrid- 
ization time, the concentration of detergent, e.g., sodium 
dodecyl sulfate (SDS), and the inclusion or exclusion of 
carrier DNA, are well known to those skilled in the art. 
Various levels of stringency are accomplished by combining 
these various conditions as needed. In a preferred 
embodiment, hybridization will occur at 30° C. in 750 mM 
NaCl, 75 mM trisodium citrate, and 1% SDS. In a more 
preferred embodiment, hybridization will occur at 37° C. in 
500 mM NaCl, 50 mM trisodium citrate, 1% SDS, 35% 
formamide, and 100 /ig/ml denatured salmon sperm DNA 
(ssDNA). In a most preferred embodiment, hybridization 
will occur at 42° C. in 250 mM NaCl, 25 mM trisodium 
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citrate, 1% SDS, 50% formamide, and 200 //g/ml ssDNA. 
Useful variations on these conditions will be readily appar- 
ent to those skilled in the art. 

The washing steps which follow hybridization can also 
vary in stringency. Wash stringency conditions can be 
defined by salt concentration and by temperature. As above, 
wash stringency can be increased by decreasing salt con- 
centration or by increasing temperature. For example, strin- 
gent salt concentration for the wash steps wiU preferably be 
less than about 30 mM NaCl and 3 mM trisodium citrate, 
and most preferably less than about 15 mM NaCl and 1.5 
mM trisodium citrate. Stringent temperature conditions for 
the wash steps will ordinarily include temperature of at least 
about 25*^ C, more preferably of at least about 4T' C, and 
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used to insert an engineered double-stranded sequence into 
a region of unknown sequence before performing PGR. 
Other methods which may be used to retrieve unknown 
sequences are known in the art. (See, e.g., Parker, J. D. et al. 
(1991) Nucleic Acids Res. 19:3055-306). Additionally, one 
may use PGR, nested primers, and PROMOTERFINDER 
libraries (Clontech, Palo Alto Calif.) to walk genomic DNA. 
This procedure avoids the need to screen libraries and is 
useful in finding intron/exon junctions. For all PGR-based 
methods, primers may be designed using commercially 
available software, such as OLIGO 4.06 primer analysis 
software (National Biosciences, Plymouth Minn.) or another 
appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to 



about 25*^ G., more preferably ot at least aooui v.., auu lengin, lo nave a vj^ cuul^ui ^^^-^ 7 
most preferably of at least about 68° G. In a preferred 15 anneal to the template at temperatures of about 68 C. to /Z 
»«,u^^;«^^«t wacVi ctp.nc will occur at 25** C. in 30 mM C. 
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embodiment, wash steps will occur at 25* C. in 30 mM 
NaCl, 3 mM trisodium citrate, and 0.1% SDS. In a more 
preferred embodiment, wash steps will occur at 42° C. in 15 
mM Naa, 1.5 mM trisodium citrate, and 0.1% SDS. In a 
most preferred embodiment, wash steps will occur at 68° C. 
in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. 
Additional variations on these conditions will be readily 
apparent to those sktQed in the art. 

Methods for DNA sequencing are well known in the art 
and may be used to practice any of the embodiments of the 25 
invention. The methods may employ such enzymes as the 
Klenow fragment of DNA polymerase I, SEQUENASE (US 
Biochemical, Cleveland Ohio), Taq polymerase (Perkin- 
Elmer), thermostable T7 polymerase (Amersham Pharmacia 



When screening for full-length cDNAs, it is preferable lo 
use libraries that have been size-selected to include larger 
cDNAs. In addition, random-primed libraries, which often 
include sequences containing the 5' regions of genes, are 
preferable for situations in which an oligo d(T) library does 
not yield a full-length cDNA. Genomic libraries may be 
useful for extension of sequence into 5' non- transcribed 
regulatory regions. 

Capillary electrophoresis systems which are commer- 
cially available may be used to analyze the size or confirm 
the nucleotide sequence of sequencing or PGR products. In 
particular, capillary sequencing may employ flowable poly- 
mers for electrophoretic separation, four different 



Elmer), thermostable 1/ polymerase i^/\mersnani i'iiaima,..ia ^^^^ electrophoretic separation, lour umcici 

Biotech, Piscataway N.J.), or combinations of polymerases 30 nucleotide-specific, laser-stimulated fluorescent dyes, and 
r.nH r.rnr.fmaflino exonucleascs such as those found in the ^u^^rr^ ^r^nr^ipH Hp.vire camera for detection of the emitte 



and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, 
Gaithersburg Md.). Preferably, sequence preparation is auto- 
mated with machines such as the Robbins Hydra microdis- 
penser (Robbins Scientific, Sunnyvale Calif), Hamilton 35 
MIGROLAB 2200 (HamUton, Reno Nev.), Peltier Thermal 
Cycler 200 (PTG200; M J Research, Watertown Mass.) and 
the ABI CATALYST 800 (Perkin -Elmer). Sequencing is 
then carried out using the ABI 373 or 377 DNA sequencing 
systems (Perkin -Elmer), the MEGABACE 1000 DNA 40 
sequencing system (Molecular Dynamics, Sunnyvale 
CaHf.), or other systems known in the art. The resulting 
sequences are analyzed using a variety of algorithms which 
are well known in the art. (See, e.g., Ausubel, F. M. (1997) 
Short Protocols in Molecular Biology, John Wiley & Sons, 45 
New York N.Y, unit 7.7; Meyers, R. A. (1995) Molecular 
Biology and Biotechnology, Wiley VCH, New York N.Y., 
pp. 856-853.) 

The nucleic acid sequences encoding HUCYP may be 
extended utilizing a partial nucleotide sequence and employ- 50 
ing various PCR-based methods known in the art to detect 
upstream sequences, such as promoters and regulatory ele- 
ments. For example, one method which may be employed, 
restriction-site PGR, uses universal and nested prirners to 



charge coupled device camera for detection of the emitted 
wavelengths. Output/light intensity may be converted to 
electrical signal using appropriate software (e.g., GENO- 
TYPER and SEQUENCE NAVIGATOR, Perkin-Ehner), 
and the entire process from loading of samples to computer 
analysis and electronic data display may be computer con- 
trolled. Capillary electrophoresis is especially preferable for 
sequencing small DNA fragments which may be present in 
limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide 
sequences or fragments thereof which encode HUCYP may 
be cloned in recombinant DNA molecules that direct expres- 
sion of HUCYP, or fragments or functional equivalents 
thereof, in appropriate host cells. Due to the inherent degen- 
eracy of the genetic code, other DNA sequences which 
encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express 
HUCYP 

llie nucleotide sequences of the present invention can be 
engineered using methods generally known in the art in 
order to alter HUCYP-en coding sequences for a variety of 
purposes including, but not limited to, modification of the 
cloning, processing, and/or expression of the gene product. 



restriction-site PGR, uses universal and "^^^^^^f DNA shuffling by random fragmentation and PGR reassem 

amplify unknown sequence ge^o^^i^,^^ ^^^^^ blv of sene fragments and synthetic oligonucleotides may b. 



bly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. For example, 
oUgonucleotide-mediated site-directed mutagenesis may be 
used to introduce mutations that create new restriction sites, 
60 alter glycosylation patterns, change codon preference, pro- 
duce splice variants, and so forth. 

In another embodiment, sequences encoding HUCYP 
may be synthesized, in whole or in part, using chemical 

fragments adjacent to known sequences m numan ana yc^. "^^^°to::f^ m Ta "n^^f Svmn l!r n^S^lT.nei 

artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 65 et al. (1980) Nucl. Acids Res. Symp. ^er- (7). 215 223 and 

PGR S 1:111-119 ) In this method, Horn, T et al. (1980) NucL Acids Symp. Ser (7): 225-232.) 

Slti\e SL^^^^^ digestions and ligations may be Alternatively, HUCYP itself or a fragment thereof may be 



cloning vector. (See, e.g., Sarkar, G. (1993) PGR Methods 
Applic. 2:318-322.) Another method, inverse PGR, uses 
primers that extend in divergent directions to amplify 
unknown sequence from a circularized template. The tem- 
plate is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., 
Triglia, T et al. (1988) Nucleic Acids Res. 16:8186.) A third 
method, capture PGR, involves PGR amplification of DNA 
fragments adjacent to known sequences in human and yeast 



6,130,077 



15 



16 



15 



20 



25 



synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solid-phase tech- 
niques. (See, e.g., Roberge, J. Y. et al. (1995) Science 
269:202-204.) Automated synthesis may be achieved using 
the ABI 431A peptide synthesizer (Perkin-Elmer). 
Additionally, the amino acid sequence of HUCYP, or any 
part thereof, may be altered during direct synthesis and/or 
combined with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative 
high performance liquid chromatography. (See, e.g, Chiez, 
R. M. and F. Z. Regnier (1990) Methods Enzynaol. 
182:392^21.) The composition of the synthetic peptides 
may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, T. (1984) Proteins. Structures and 
Molecular Properties, W H Freeman, New York N.Y.) 

In order to express a biologically active HUCYP, the 
nucleotide sequences encoding HUCYP or derivatives 
thereof may be inserted into an appropriate expression 
vector, i.e., a vector which contains the necessary elements 
for transcriptional and translational control of the inserted 
coding sequence in a suitable host. These elements include 
regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the 
vector and in polynucleotide sequences encoding HUCYP. 
Such elements may vary in their strength and specificity. 
Specific initiation signals may also be used to achieve more 
efBcient translation of sequences encoding HUCYP. Such 
signals include the ATG initiation codon and adjacent 
sequences, e.g. the Kozak sequence. In cases where 
sequences encoding HUCYP and its initiation codon and 
upstream regulatory sequences are inserted into the appro- 
priate expression vector, no additional transcriptional or 
translational control signals may be needed. However, in 
cases where only coding sequence, or a fragment thereof, is 
inserted, exogenous translational control signals including 
an in-frame ATG initiation codon should be provided by the 
vector. Exogenous translational elements and initiation 
codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclu- 
sion of enhancers appropriate for the particular host cell 
system used. (See, e.g., Scharf, D. et al. (1994) Results 
Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art 
may be used to construct expression vectors containing 
sequences encoding HUCYP and appropriate transcriptional 45 
and translational control elements. These methods include in 
vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J- 
et al. (1989) Molecular Cloning, A Laboratory Manual, 
Cold Spring Harbor Press, Plainview N.Y., ch. 4, 8, and 
16-17; Ausubel, F. M. et al. (1995) Current Protocols in 
Molecular Biology, John Wiley & Sons, New York N.Y., ch. 
9, 13, and 16.) 

A variety of expression vector/host systems may be 
utilized to contain and express sequences encoding HUCYP. 
These include, but are not limited to, microorganisms such 
as bacteria transformed with recombinant bacteriophage, 
plasmid, or cosmid DNA expression vectors; yeast trans- 
formed with yeast expression vectors; insect cell systems 
infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors 
(e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic 
virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. ITie invention is 
not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression 
vectors may be selected depending upon the use intended for 



30 



35 



40 



50 



55 



60 



65 



polynucleotide sequences encoding HUCYP. For example, 
routine cloning, subcloning, and- propagation of polynucle- 
otide sequences encoding HUCYP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT 
(Stratagene, La Jolla Calif.) or pSPORTl plasmid (Life 
Technologies). Ligation of sequences encoding HUCYP into 
the vector's multiple cloning site disrupts the lacZ gene, 
allowing a colorimetric screening procedure for identifica- 
tion of transformed bacteria containing recombinant niol- 
ecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with 
helper phage, and creation of nested deletions in the cloned 
sequence. (See, e.g.. Van Heeke, G. and S. M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509.) When large quan- 
tities of HUCYP are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of 
HUCYP may be used. For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be 
used. 

Yeast expression systems may be used for production of 
HUCYP- A number of vectors containing constitutive or 
inducible promoters, such as alpha factor, alcohol oxidase, 
and PGH promoters, may be used in the yeast Saccharomy- 
ces cerevisiae or Pichia pastoris. In addition, such vectors 
direct either the secretion or intracellular retention of 
expressed proteins and enable integration of foreign 
sequences into the host genome for stable propagation. (See, 
e.g., Ausubel, 1995, supra; Grant et al. (1987) Methods 
Enzymol. 153:516-54; and Scorer, C. A. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of HUCYP. 
Transcription of sequences encoding HUCYP may be driven 
by viral promoters, e.g., the 35S and 19S promoters of 
CaMV used alone or in combination with the omega leader 
sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-311). Alternatively, plant promoters such as the small 
subunit of RUBISCO or heat shock promoters may be used. 
(See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 
Broglie, R. et al. (1984) Science 224:838-843; and Winter, 
J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) These 
constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. (See, 
e.g.. The McGrawHill Yearbook of Science and Technology 
(1992) McGraw HiU, New York N.Y, pp. 191-196.) 

In mammalian cells, a number of viral-based expression 
systems may be utilized. In cases where an adenovirus is 
used as an expression vector, sequences encoding HUCYP 
may be ligated into an adenovirus transcription/translation 
complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the 
viral genome may be used to obtain infective virus which 
expresses HUCYP in host cells. (See, e.g., Logan, J. and T. 
Shenk (1984) Proc, Natl. Acad. Sci. 81:3655-3659.) In 
addition, transcription enhancers, such as the Rous sarconia 
virus (RSV) enhancer, may be used to increase expression in 
mammafian host ceUs. SV40 or EBV-based vectors may also 
be \ised for high-level protein expression. 

Human artificial chromosomes (HACs) may also be 
employed to deliver larger fragments of DNA than can be 
contained in and expressed from a plasmid. HACs of about 
6 kb to 10 Mb are constructed and delivered via conven- 
tional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., 
Harrington, J. J. et al. (1997) Nat Genet. 15:345-355.) 

For long term production of recombinant proteins in 
mammalian systems, stable expression of HUCYP in cell 
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lines is preferred. For example, sequences encoding HUCYP 
can be transformed into cell lines using expression vectors 
which may contain viral origins of replication and/or endog- 
enous expression elements and a selectable marker gene on 
the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for about 1 to 2 
days in enriched media before being switched to selective 
media. The purpose of the selectable marker is to confer 
resistance to a selective agent, and its presence allows 



6,130,077 

18 

and other assays are well known in the art. (See, e.g., 
Hampton, R. et al. (1990) Serological Methods, a Labora- 
tory Manual, APS Press, St Paul Minn., Sect. TV; Cohgan, 
J. E. et al. (1997) Current Protocols in Immunology, Greene 
Pub. Associates and Wiley-Interscience, New York N.Y.; 
and Pound, J. D. (1998) Immunochemical Protocols, 
Humana Press, Totowa N.J.). 

A wide variety of labels and conjugation techniques are 
known by those skiUed in the art and may be used in various 
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laucicu hybridization or PGR probes for detecting sequences 
related to polynucleotides encoding HUCYP include 
oligolabeling, nick translation, end-labeling, or PGR ampli- 
fication using a labeled nucleotide. Alternatively, the 
sequences encoding HUCYP, or any fragments thereof, may 
be cloned into a vector for the production of an mRNA 
probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in 
vitro by addition of an appropriate RNA polymerase such as 
T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available 
kits, such as those provided by Amersham Pharmacia 
Biotech, Promega (Madison Wis.), and US Biochemical. 
Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, 
and the like. 

Host cells transformed with nucleotide sequences encod- 
30 ing HUCYP may be cultured under conditions suitable for 
the expression and recovery of the protein from cell culture. 
The protein produced by a transformed cell may be secreted 
or retained intracellularly depending on the sequence and/or 
the vector used. As will be understood by those of skill in the 
35 art, expression vectors containing polynucleotides which 
encode HUCYP may be designed to contain signal 
sequences which direct secretion of HUCYP through a 
prokaryotic or eukaryotic cell membrane. 
Although Ihe presence/absence ot marKer gene In addition, a host cell strain may be chosen for its ability 

su£Satthe^geneofinterestisalsopresent,thepresence 40 to modulate expression of the mserted -^Jl^^^^^ sJh 
and Spression of the gene may need to be confirmed. For process the expressed protein m the desired fashion Such 
exlX^ the sequence encoding HUCYP is inserted modifications of the polypeptide include, but are no hmited 
wTto a marker gene sequence, transformed cells containing to, acetylation. carboxylation, glycosyla ion 
Il<^^enc^ encoding HUCYP can be identified by the phosphorylation, lipidation, and acylation. Post-translational 
rblLe of r^aTker g^ function. Alternatively, a marker 45 processing which cleaves a "prepro" fo™ of the prote.^^^^^ 
aoscntc "| „ „„„^„^ enf^odinsr he used to soecify protein targeting, folding, and/or 



formed cells may be propagated using tissue culture tech 
niques appropriate to the cell type. 

Any number of selection systems may be used to recover 
transformed cell lines. These include, but are not limited to, 
the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferasc genes, for use in tk" or apr cells, 
respectively. (See, e.g., Wigler, M. et al. (1977) Cell 
11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, 
antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers 
resistance to methotrexate; neo confers resistance to the 
aminoglycosides, neomycin and G-418; and als or pat confer 
resistance to chlorsulfuron and phosphinotricin 
acetyltransferase, respectively. (See, e.g., Wigler, M. el al. 
(1980) Proc. Natl. Acad. Sci. 77:3567-3570; Colbere- 
Garapin, R et al. (1981) J. Mol. Biol. 150:1-14.) Additional 
selectable genes have been described, e.g., trpB and hisD, 
which alter cellular requirements for metabolites. (See, e.g., 
Hartman, S. G, and R. G. Mulligan (1988) Proc. Natl. Acad. 
Sci 85:8047-8051.) Visible markers, e.g., anthocyanins, 
green fluorescent proteins (GFP; Clontech), p glucuronidase 
and its substrate p-glucuronide, or luciferase and its sub- 
strate luciferin may be used. These markers can be used not 
only to identify transform ants, but also to quantify the 
amount of transient or stable protein expression attributable 
to a specific vector system. (See, e.g., Rhodes, C. A. (1995) 
Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression 



gene can be placed in tandem with a sequence encoding 
HUCYP under the control of a single promoter. Expression 
of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid 
sequence encoding HUCYP and that express HUCYP may 
be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited 
to, DNA — DNA or DNA — RNA hybridizations, PGR 



also be used to specify protein targeting, folding, and/or 
activity. Different host cells which have specific cellular 
machinery and characteristic mechanisms for post- 
translational activities (e.g., GHO, HeLa, MDGK, HEK293, 
50 and WI38), are available from the American Type Culture 
Collection (ATCC, Bethesda Md.) and may be chosen to 
ensure the correct modification and processing of the foreign 
protein. 

In another embodiment of the invention, natural. 
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amnHfication and protein bioassay or immunoassay tech- 55 modified, or recombinant nucleic acid sequences encoding 
ampuncaiion, anu pruicm uiua» y j i;„atf>H to a heferoloirous seauence resulting 



niques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of 
nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the 
expression of HUCYP using either specific polyclonal or 
monoclonal antibodies are known in the art. Examples of 
such techniques include enzyme -linked immunosorbent 
assays (ELISAs), radioimmunoassays (RIAs), and fluores- 
cence activated cell sorting (FACS). A two-site, monoclonal- 
based immunoassay utilizing monoclonal antibodies reac- 
tive to two non-interfering epitopes on HUCYP is preferred, 
but a competitive binding assay may be employed. These 



HUCYP may be ligated to a heterologous sequence resulting 
in translation of a fusion protein in any of the aforemen- 
tioned host systems. For example, a chimeric HUCYP 
protein containing a heterologous moiety that can be recog- 
60 nized by a commercially avaUable antibody may facilitate 
the screening of peptide libraries for inhibitors of HUCYP 
activity. Heterologous protein and peptide moieties may also 
faciUtate purification of fusion proteins using commercially 
available aflSnity matrices. Such moieties include, but are 
65 not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin bind- 
ing peptide (CBP), 6-IIis, FLAG, c-myc, and hemagglutinin 
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(HA). GST, MBP, Trx, CBP, and 6-His enable purification of 
their cognate fusion proteins on immobiUzed glutathione, 
maltose, phenylarsine oxide, calmoduUn, and metal-chelate 
resins, respectively. FLAG, c-myc, and hemagglutinin (HA) 
enable immunoaffinity purification of fusion proteins using 
commercially available monoclonal and polyclonal antibod- 
ies that specifically recognize these epitope tags. A fusion 
protein may also be engineered to contain a proteolytic 
cleavage site located between the HUCYP encoding 
sequence and the heterologous protein sequence, so that 
HUCYP may be cleaved away from the heterologous moiety 
following purification. Methods for fusion protein expres- 
sion and purification are discussed in Ausubel (1995, supra, 
ch 10). A variety of commercially available kits may also be 
used to facihtate expression and purification of fusion pro- 
teins. 

In a further embodiment of the invention, synthesis of 
radiolabeled HUCYP may be achieved in vitro using the 
TNT rabbit reticulocyte lysate or wheat germ extract sys- 
tems (Promega). These systems couple transcription and 
translation of protein-coding sequences operably associated 
with the T7, T3, or SP6 promoters. Translation takes place 
in the presence of a radiolabeled amino acid precursor, 
preferably ^^S-methionine. 

Fragments of HUCYP may be produced not only by 
recombinant production, but also by direct peptide synthesis 
using soUd-phase techniques. (See, e.g., Creighton, supra 
pp. 55-60.) Protein synthesis may be performed by manual 
techniques or by automation. Automated synthesis may be 
achieved, for example, using the ABI 431 A peptide synthe- 
sizer (Perkin-Ehner). Various fragments of HUCYP may be 
synthesized separately and then combined to produce the 
full length molecule. 

Therapeutics 

Chemical and structural similarity, e.g., in the context of 
sequences and motifs, exists between regions of HUCYP 
and cytochrome P450 proteins. In addition, the expression of 
HUCYP is closely associated with cancer and cell 
proUferation, inflammation and immune response, reproduc- 
tive tissues, nervous tissues, gastrointestinal tissues, and 
cardiovascular tissues. Therefore, HUCYP appears to play a 
role in cell proHferative, developmental, autoimmune/ 
inflammatory, and metabolic disorders. In the treatment of 
disorders associated with increased HUCYP expression or 
activity, it is desirable to decrease the expression or activity 
of HUCYP. In the treatment of disorders associated with 
decreased HUCYP expression or activity, it is desirable to 
increase the expression or activity of HUCYP. 

Therefore, in one embodiment, HUCYP or a fragment or 
derivative thereof may be administered to a subject to treat 
or prevent a disorder associated with decreased expression 
or activity of HUCYP. Examples of such disorders mclude, 
but are not Umited to, a cell proHferative disorder such as 
actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma 
teratocarcinoma, and, in particular, cancers of the adrenal 
gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and 
uterus; a developmental disorder such as renal tubular 
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acidosis, anemia, Cushing's syndrome, achondroplastic 
dwarfism, Duchenne and Becker muscular dystrophy, 
epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms 
tumor aniridia, genitourinary abnormalities, and mental 
retardation), Smith-Magenis syndrome, myelodysplastic 
syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Charcot- 
Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as 
Syndenham's chorea and cerebral palsy, spina bifida, 
anencephaly, craniorachischisis, congenital glaucoma 
cataract, and sensorineural hearing loss; an autoimmune/ 
inflammatory disorder such as acquired immunodeficiency 
syndrome (AIDS), Addison's disease, adult respiratory dis- 
tress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autounmune 
hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyenodocrinopathy-candidiasis-ectodermal dystrophy 
(APECED), bronchitis, cholecystitis, contact dermatitis, 
Crohn's disease, atopic dermatitis, dermato myositis, diabe- 
tes mellitus, emphysema, episodic lymphopenia with 
lymphocytotoxins, erythroblastosis fetalis, erythema 
nodosum, atrophic gastritis, glomerulonephritis, Goodpas- 
ture's syndrome, gout, Graves' disease, Hashimoto s 
thyroiditis, hypereosinophilia, irritable bowel syndrome, 
multiple sclerosis, myasthenia gravis, myocardial or peri- 
cardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reitcr's syndrome, 
rheumatoid arthritis, scleroderma, Sjogren's syndrome, sys- 
temic anaphylaxis, systemic lupus erythematosus, systemic 
sclerosis, thrombocytopenic purpura, ulcerative cohtis, 
uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic 
infections, and trauma; and a metaboUc disorder such as 
Addison's disease, cystic fibrosis, diabetes, fatty 
hepatocirrhosis, galactosemia, goiter, hyperadrenalism, 
hypoadrenalism, hyperparathyroidism, hypoparathyroidism, 
hypercholesterolemia, hyperthyroidism, hypothyroidism 
hyperlipidemia, hyperiipemia, lipid myopathies, obesity 
lipodystrophies, and phenylketonuria, congenital adrenal 
hyperplasia, pseudovitamin D-deficiency rickets, cerebro- 
tendinous xanthomatosis, and coumarin resistance. 

In another embodiment, a vector capable of expressing 
HUCYP or a fragment or derivative thereof may be admin- 
istered to a subject to treat or prevent a disorder associated 
with decreased expression or activity of HUCYP including, 
but not limited to, those described above- 
In a further embodiment, a pharmaceutical composition 
comprising a substantiaUy purified HUCYP in conjunction 
with a suitable pharmaceutical carrier may be administered 
to a subject to treat or prevent a disorder associated with 
decreased expression or activity of HUCYP including, but 
not limited to, those provided above. 

In still another embodiment, an agonist which modulates 
the activity of HUCYP may be administered to a subject to 
treat or prevent a disorder associated with decreased expres- 
sion or activity of HUCYP including, but not limited to, 
those listed above. 

In a further embodiment, an antagonist of HUCYP may be 
administered to a subject to treat or prevent a disorder 
associated with increased expression or activity of HUCYP. 
Such disorders may include, but are not limited to, cell 
proliferative, developmental, autoimmune/inflammatory, 
and metabolic disorders as listed above. In one aspect, an 
antibody which specifically binds HUCYP may be used 
directly as an antagonist or indirectly as a targeting or 
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delivery mechanism for bringing a pharmaceutical agent to 
cells or tissue which express HUCYR 

In an additional embodiment* a vector expressing the 
complement of the polynucleotide encoding HUCYP may 
be administered to a subject to treat or prevent a disorder 
associated with increased expression or activity of HUCYP 
including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, 
antibodies, agonists, complementary sequences, or vectors 
of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appro- 
priate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional 
pharmaceutical principles. The combination of therapeutic 
agents may act synergistically to effect the treatment or 
prevention of the various disorders described above. Using 
this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the 
potential for adverse side effects. 

An antagonist of HUCYP may be produced using meth- 
ods which are generally known in the art. In particular, 
purified HUCYP may be used to produce antibodies or to 
screen libraries of pharmaceutical agents to identify those 
which specifically bind HUCYP. Antibodies to HUCYP may 
also be generated using methods that are well known in the 
art. Such antibodies may include, but are not limited to, 
polyclonal, monoclonal, chimeric, and single chain 
antibodies. Fab fragments, and fragments produced by a Fab 
expression library. Ncutrahzing antibodies (i.e., those which 
inhibit dimer formation) are especially preferred for thera- 
peutic use. 

For the production of antibodies, various hosts including 
goats, rabbits, rats, mice, humans, and others may be immu- 
nized by injection with HUCYP or with any fragment or 
oligopeptide thereof which has immunogenic properties. 
Depending on the host species, various adjuvants may be 
used to increase immunological response. Such adjuvants 
include, but are not limited to, Freund^s, mineral gels such 
as aluminum hydroxide, and surface active substances such 
as iysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, KLH, and dinitrophenol. Among adjuvants used 
in humans, BCG (bacilli Calmette-Guerin) and Corynehac- 
terium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or frag- 
ments used to induce antibodies to HUCYP have an amino 
acid sequence consisting of at least about 5 amino acids, and, 
more preferably, of at least about 10 amino acids. It is also 
preferable that these oligopeptides, peptides, or fragments 
are identical to a portion of the amino acid sequence of the 
natural protein and contain the entire amino acid sequence of 
a small, naturally occurring molecule. Short stretches of 
HUCYP amino acids may be fused with those of another 
protein, such as KLH, and antibodies to the chimeric mol- 
ecule may be produced. 

Monoclonal antibodies to HUCYP may be prepared using 
any technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique, the human 
B-ceil hybridoma technique, and the EBV-hybridoma tech- 
nique. (See, e.g., Kohler, G. et al. (1975) Nature 
256:495^97; Kozbor, D. et al. (1985) J. Immunol. Methods 
81:31^2; Cote, R. J. et al. (1983) Proc. Natl. Acad. Sci. 
80:2026-2030; and Cole, S. P. et al. (1984) Mol. Cell Biol. 
62:109-120.) 

In addition, techniques developed for the production of 
"chimeric antibodies," such as the splicing of mouse anti- 



50,077 

22 

body genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity, 
can be used. (See, e.g., Morrison, S. L. et al. (1984) Proc. 
Natl. Acad. Sci. 81:6851-6855; Neuberger, M. S. et al. 
5 (1984) Nature 312:604-608; and Takeda, S. et al. (1985) 
Nature 314:452-^54.) Alternatively, techniques described 
for the production of single chain antibodies may be 
adapted, using methods known in the art, to produce 
HUCYP-specific single chain antibodies. Antibodies with 

no related specificity, but of distinct idiotypic composition, may 
be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g.. Burton D. R. (1991) 
Proc. Natl. Acad. Sci. 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo 

15 production in the lymphocyte population or by screening 
immunoglobulin libraries or panels of highly specific bind- 
ing reagents as disclosed in the literature. (See, e.g., Orlandi, 
R. et al. (1989) Proc. Natl. Acad. Sci. 86:3833-3837; Winter, 
G. et al. (1991) NaUire 349:293-299.) 

Antibody fragments which contain specific binding sites 
for HUCYP may also be generated. For example, such 
fragments include, but are not limited to, F(ab*)2 fragments 
produced by pepsin digestion of the antibody molecule and 
Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries 
may be constructed to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. (See, 
e.g., Huse, W. D. et al. (1989) Science 246:1275-1281.) 

2P Various immunoassays may be used for screening to 
identify antibodies having the desired specificity. Numerous 
protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies 
with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of com- 
plex formation between HUCYP and its specific antibody. A 
two-site, monoclonal-based immunoassay utilizing mono- 
clonal antibodies reactive to two non -interfering HUCYP 
epitopes is preferred, but a competitive binding assay may 
also be employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunc- 
tion with radioimmunoassay techniques may be used to 
assess the aflSnity of antibodies for HUCYP. Affinity is 
expressed as an association constant, K^, which is defined as 

45 the molar concentration of HUCYP-antibody complex 
divided by the molar concentrations of free antigen and free 
antibody under equilibrium conditions. The determined 
for a preparation of polyclonal antibodies, which are het- 
erogeneous in their afiSnities for multiple HUCYP epitopes, 

50 represents the average affinity, or avidity, of the antibodies 
for HUCYP. The determined for a preparation of mono- 
clonal antibodies, which are monospecific for a particular 
HUCYP epitope, represents a true measure of affinity. High- 
affinity antibody preparations with ranging from about 

55 lO^ to 10^^ 1/mole are preferred for use in immunoassays in 
which the HUCYP-antibody complex must withstand rig- 
orous manipulations. Low-affinity antibody preparations 
with ranging from about 10^ to 10^ 1/mole are preferred 
for use in immunopurlfication and sunilar procedures which 

60 ultimately require dissociation of HUCYP, preferably in 
active form, from the antibody (Catty, D. (19SS) A ntibodieSj 
Volume I: A Practical Approach, IRL Press, Washington 
D.C.; Liddell, J. E. and Cryer, A. (1991) A Practical Guide 
to Monoclonal Antibodies, John Wiley & Sons, New York 

65 N.Y.). 

The titer and avidity of polyclonal antibody preparations 
may be further evaluated to determine the quality and 
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suitability of such preparations for certain downstream 
applications. For example, a polyclonal antibody prepara- 
tion containing at least 1-2 mg specific antibody/ml, pref- 
erably 5-10 mg specific antibody/ml, is preferred for use in 
procedures requiring precipitation of HUCYP-antibody 5 
complexes. Procedures for evaluating antibody specificity, 
titer, and avidity, and guidelines for antibody quality and 
usage in various applications, are generally available. (See, 
e.g.. Catty, supra, and Coligan et al. supra.) 

In another embodiment of the invention, the polynucle- ^0 
otides encoding HUCYP, or any fragment or complement 
thereof, may be used for therapeutic purposes. In one aspect, 
the complement of the polynucleotide encoding HUCYP 
may be used in situations in which it would be desirable to 
block the transcription of the mRNA. In particular, cells may ^5 
be transformed with sequences complementary to poly- 
nucleotides encoding HUCYP. Thus, complementary mol- 
ecules or fragments may be used to modulate HUCYP 
activity, or to achieve regulation of gene function. Such 
technology is now well known in the art, and sense or 2o 
antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control 
regions of sequences encoding HUCYP. 

Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various 
bacterial plasmids, may be used for delivery of nucleotide 
sequences to the targeted organ, tissue, or cell population. 
Methods which are well known to those skilled in the art can 
be used to construct vectors to express nucleic acid 
sequences complementary to the polynucleotides encoding 
HUCYP. (See, e.g., Sambrook, supra; Ausubel, 1995, supra.) 

Genes encoding HUCYP can be turned off by transform- 
ing a cell or tissue with expression vectors which express 
high levels of a polynucleotide, or fragment thereof, encod- 
ing HUCYP. Such constructs may be used to introduce 
untranslatable sense or antisense sequences into a cell. Even 
in the absence of integration into the DNA, such vectors may 
continue to transcribe RNA molecules until they are disabled 
by endogenous nucleases. Transient expres.sion may last for ^ 
a month or more with a non-repUcating vector, and may last 
even longer if appropriate replication elements are part of 
the vector system. 

As mentioned above, modifications of gene expression 
can be obtained by designing complementary sequences or 45 
antisense molecules (DNA, RNA, or PNA) to the control, 5', 
or regulatory regions of the gene encoding HUCYP. Oligo- 
nucleotides derived from the transcription initiation site, 
e.g., between about positions -10 and +10 from the start site, 
are preferred. Similarly, inhibition can be achieved using 50 
triple helix base-pairing methodology. Triple helix pairing is 
useful because it causes inhibition of the ability of the 
double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. 
Recent therapeutic advances using triplex DNA have been 55 
described in the literature. (See, e.g.. Gee, J. E. et al. (1994) 
in Huber, B. E. and B. I. Carr, Molecular and Immunologic 
Approaches, Futura Publishing, Mt. Kisco N.Y., pp. 
163-177.) A complementary sequence or antisense molecule 
may also be designed to block translation of mRNA by 60 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used 
to catalyze the specific cleavage of RNA. The mechanism of 
ribozyme action involves sequence-specific hybridization of 
the ribozyme molecule to complementary target RNA, fol- 65 
lowed by endonucleo lytic cleavage. For example, engi- 
neered hammerhead motif ribozyme molecules may specifi- 
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cally and efficiently catalyze endonuclcolytic cleavage of 
sequences encoding HUCYP. 

Specific ribozyme cleavage sites within any potential 
RNA target are initially identified by scanning the target 
molecule for ribozyme cleavage sites, including the follow- 
ing sequences: GUA, GUU, and GUC. Once identified, short 
RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the 
cleavage site, may be evaluated for secondary structural 
features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by 
testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and 
ribozymes of the invention may be prepared by any method 
known in the art for the synthesis of nucleic acid molecules, 
lliese include techniques for chemically synthesizing oli- 
gonucleotides such as solid phase phosphoramidite chemical 
synthesis. Alternatively, RNA molecules may be generated 
by in vitro and in vivo transcription of DNA sequences 
encoding HUCYP. Such DNA sequences may be incorpo- 
rated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, 
these cDNA constructs that synthesize complementary 
RNA, constitutively or inducibly, can be introduced into cell 
lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular 
stability and half -life. Possible modifications include, but are 
not limited to, the addition of flanking sequences at the 5' 
and/or 3' ends of the molecule, or the use of phosphorothio- 
ate or 2' O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inher- 
ent in the production of PNAs and can be extended in all of 
these molecules by the inclusion of nontraditional bases 
such as inosine, queosine, and wybutosine, as well as 
acetyl-, methyl-, thio-, and similarly modified forms of 
adenine, cytidine, guanine, thymine, and uridine which are 
not as easily recognized by endogenous cndonucleases. 

Many methods for introducing vectors into cells or tissues 
are available and equally suitable for use in vivo, in vitro, 
and ex vivo. For ex vivo therapy, vectors may be introduced 
into stem cells taken from the patient and clonally propa- 
gated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by 
polycationic amino polymers may be achieved using meth- 
ods which are well known in the art. (See, e.g., Goldman, C. 
K. et al. (1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be 
applied to any subject in need of such therapy, including, for 
example, mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the 
administration of a pharmaceutical or sterile composition, in 
conjunction with a pharmaceutically acceptable carrier, for 
any of the therapeutic effects discussed above. Such phar- 
maceutical compositions may consist of HUCYP, antibodies 
to HUCYP, and mimetics, agonists, antagonists, or inhibi- 
tors of HUCYP The compositions may be administered 
alone or in combination with at least one other agent, such 
as a stabilizing compound, which may be administered in 
any sterile, biocompatible pharmaceutical carrier including, 
but not limited to, saline, buffered saline, dextrose, and 
water. The compositions may be administered to a patient 
alone, or in combination with other agents, drugs, or hor- 
mones. 

The pharmaceutical compositions utilized in this inven- 
tion may be administered by any number of routes including. 
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but not limited to, oral, intravenous, intramuscular, intra- 
arterial, intramedullary, intrathecal, intraventricular, 
transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical 
compositions may contain suitable pharmaceutically- 
acceptable carriers comprising excipients and auxiliaries 
which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. Further 
details on techniques for formulation and administration 
may be found in the latest edition ot Remington's Pharma- 
ceutical Sciences (Maack Publishing, Easlon Pa.). 

Pharmaceutical compositions for oral administration can 
be formulated using pharmaceutically acceptable carriers 
well known in the art in dosages suitable for oral adminis- 
tration. Such carriers enable the pharmaceutical composi- 
tions to be formulated as tablets, pills, dragee s, capsules, 
h quids, gels, syrups, slurries, suspensions, and the like, for 
ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained 
through combining active compounds with solid excipient 
and processing the resultant mixture of granules (optionally, 
after grinding) to obtain tablets or dragee cores. Suitable 
auxiliaries can be added, if desired. Suitable excipients 
include carbohydrate or protein fillers, such as sugars, 
including lactose, sucrose, mannitol, and sorbitol; starch 
from corn, wheat, rice, potato, or other plants; cellulose, 
such as methyl cellulose, hydroxypropylmethyl-cellulose, or 
sodium carboxymethylcellulose; gums, including arabic and 
Iragacanth; and proteins, such as gelatin and collagen. If 
desired, disintegrating or solubilizing agents may be added, 
such as the cross-linked polyvinyl pyrrolidone, agar, and 
alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable 
coatings, such as concentrated sugar solutions, which may 
also contain gum arabic, talc, polyvinylpyrrolidone, car- 
bopol gel, polyethylene glycol, and/or titanium dioxide, 
lacquer solutions, and suitable organic solvents or solvent 
mixtures. Dyestuffs or pigments may be added to the tablets 
or dragee coatings for product identification or to charac- 
terize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally 
include push-fit capsules made of gelatin, as well as soft, 
sealed capsules made of gelatin and a coating, such as 
glycerol or sorbitol. Push-fit capsules can contain active 
ingredients mixed with fiUers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium steeirate, and, 
optionally, stabilizers. In soft capsules, the active com- 
pounds may be dissolved or suspended in suitable liquids, 
such as fatty oils, liquid, or liquid polyethylene glycol with 
or without stabilizers. 

Pharmaceutical formulations suitable for parenteral 
administration may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as 
Hanks* solution. Ringer's solution, or physiologically buff- 
ered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, 
such as sodium carboxymethyl cellulose, sorbitol, or dext- 
ran. Additionally, suspensions of the active compounds may 
be prepared as appropriate oily injection suspensions. Suit- 
able lipophilic solvents or vehicles include fatty oils, such as 
sesame oil, or synthetic fatty acid esters, such as ethyl oleate, 
triglycerides, or liposomes. Non-Upid polycationic amino 
polymers may also be used for delivery. Optionally, the 
suspension may also contain suitable stabilizers or agents to 
increase the solubility of the compounds and allow for the 
preparation of highly concentrated solutions. 



For topical or nasal administration, penetrants appropriate 
to the particular barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art. 
The pharmaceutical compositions of the present invention 
5 may be manufactured in a manner that is known in the art, 
e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a 
-iQ salt and can be formed with many acids, including but not 
limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, 
malic, and succinic acids. Salts tend to be more soluble in 
aqueous or other pro tonic solvents than are the correspond- 
ing free base forms. In other cases, the preferred preparation 
may be a lyophilized powder which may contain any or all 
of the following: 1 mM to 50 mM histidine, 0.1% to 2% 
sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, 
that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, 
they can be placed in an appropriate container and labeled 
for treatment of an indicated condition. For administration 
of HUCYP, such labeling would include amount, frequency, 
and method of administration. 

Pharmaceutical compositions suitable for use in the 
25 invention include compositions wherein the active ingredi- 
ents are contained in an effective amount to achieve the 
intended purpose. The determination of an effective dose is 
well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can 
30 be estimated initially either in cell culture assays, e.g., of 
neoplastic cells or in animal models such as mice, rats, 
rabbits, dogs, or pigs. An animal model may also be used to 
determine the appropriate concentration range and route of 
administration. Such information can then be used to deter- 
35 mine useful doses and routes for administration in humans. 
A therapeutically effective dose refers to that amount of 
active ingredient, for example HUCYP or fragments thereof, 
antibodies of HUCYP, and agonists, antagonists or inhibi- 
tors of HUCYP, which ameliorates the symptoms or condi- 
40 tion. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with 
experimental animals, such as by calculating the ED^^ (the 
dose therapeutically effective in 50% of the population) or 
LD5Q (the dose lethal to 50% of the population) statistics. 
45 The dose ratio of toxic to therapeutic effects is the thera- 
peutic index, and it can be expressed as the LD50/ED5Q ratio. 
Pharmaceutical compositions which exhibit large therapeu- 
tic indices are preferred. The data obtained from cell culture 
assays and animal studies are used to formulate a range of 
dosage for human use. The dosage contained in such com- 
positions is preferably within a range of circulating concen- 
trations that includes the ED50 with little or no toxicity. The 
dosage varies within this range depending upon the dosage 
form employed, the sensitivity of the patient, and the route 
of administration. 

The exact dosage will be determined by the practitioner, 
in light of factors related to the subject requiring treatment. 
Dosage and administration are adjusted to provide sufiScient 
levels of the active moiety or to maintain the desired effect. 
60 Factors which may be taken into account include the sever- 
ity of the disease state, the general health of the subject, the 
age, weight, and gender of the subject, time and frequency 
of administration, drug combination(s), reaction 
sensitivities, and response to therapy. I^ng-acting pharma- 
65 ceutical compositions may be administered every 3 to 4 
days, every week, or biweekly depending on the half -life 
and clearance rate of the particular formulation. 
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Normal dosage amounts may vary from about 0.1 /ig to 
100,000 |<g, up to a total dose of about 1 gram, depending 
upon the route of administration. Guidance as to particular 
dosages and methods of delivery is provided in the literature 
and generally available to practitioners in the art. Those 
skilled in the art will employ different formulations for 
nucleotides than for proteins or their inhibitors. Similarly, 
delivery of polynucleotides or polypeptides will be specific 
to particular cells, conditions, locations, etc. 

Diagnostics 

In another embodiment, antibodies which specifically 
bind HUCYP may be used for the diagnosis of disorders 
characterized by expression of HUCYP, or in assays to 
monitor patients being treated with HUCYP or agonists, 
antagonists, or inhibitors of HUCYP. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as 
described above for therapeutics. Diagnostic assays for 
HUCYP include methods which utilize the antibody and a 
label to detect HUCYP in human body fluids or in extracts 
of cells or tissues. ITie antibodies may be used with or 
without modification, and may be labeled by covalent or 
non-covalent attachment of a reporter molecule. A wide 
variety of reporter molecules, several of which are described 
above, are known in the art and may be used. 

A variety of protocols for measuring HUCYP, including 
ELISAs, RIAs, and FACS, are known in the art and provide 
a basis for diagnosing altered or abnormal levels of HUCYP 
expression. Normal or standard values for HUCYP expres- 
sion are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, preferably 
human, with antibody to HUCYP under conditions suitable 
for complex formation. The amount of standard complex 
formation may be quantiiated by various methods, prefer- 
ably by photometric means. Quantities of HUCYP expressed 
in subject samples, control and disease from biopsied tissues 
are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucle- 
otides encoding HUCYP may be used for diagnostic pur- 
poses. The polynucleotides which may be used include 
oUgonucleotide sequences, complementary RNA and DNA 
molecules, and PNAs. The polynucleotides may be used to 
detect and quantitate gene expression in biopsied tissues in 
which expression of HUCYP may be correlated with dis- 
ease. The diagnostic assay may be used to determine 
absence, presence, and excess expression of HUCYP, and to 
monitor regulation of HUCYP levels during therapeutic 
intervention. 

In one aspect, hybridization with PCR probes which are 
capable of detecting polynucleotide sequences, including 
genomic sequences, encoding HUCYP or closely related 
molecules may be used to identify nucleic acid sequences 
which encode HUCYP. The specificity of the probe, whether 
it is made from a highly specific region, e.g., the 5' regula- 
tory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplifica- 
tion (maximal, high, intermediate, or low), will determine 
whether the probe identifies only naturally occurring 
sequences encoding HUCYP, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related 
sequences, and should preferably have at least 50% 
sequence identity to any of the HUCYP encoding sequences. 
The hybridization probes of the subject invention may be 
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DNA or RNA and may be derived from the sequence of SEQ 
ID NO:2 or from genomic sequences including promoters, 
enhancers, and introns of the HUCYP gene. 

Means for producing specific hybridization probes for 
DNAs encoding HUCYP include the cloning of polynucle- 
otide sequences encoding HUCYP or HUCYP derivatives 
into vectors for the production of mRNA probes. Such 
vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of 
the addition of the appropriate RNA polymerases and the 
appropriate labeled nucleotides. Hybridization probes may 
be labeled by a variety of reporter groups, for example, by 
radionuclides such as ^^P or 35S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via 
avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding HUCYP may be used 
for the diagnosis of disorders associated with expression of 
HUCYP. Examples of such disorders include, but are not 
limited to, a cell proliferative disorder such as actinic 
keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), 
myelofibrosis, paroxysmal nocturnal hemoglobinuria, poly- 
cythemia vera, psoriasis, primary throrabocythemia, and 
cancers including adenocarcinoma, leukemia, lymphoma, 
melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone 
marrow, brain, breast, cervix, gall bladder, gangha, gas- 
trointestinal tract, heart, kidney, liver, lung, muscle, ovary, 
pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, and uterus; a developmental 
disorder such as renal tubular acidosis, anemia, Cushing*s 
syndrome, achondroplastic dwarfism, Duchenne and Becker 
muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR 
syndrome (Wilms' tumor, aniridia, genitourinary 
abnormalities, and mental retardation), Smith-Magenis 
syndrome, myelodysplastic syndrome, hereditary mucocpi- 
thelial dysplasia, hereditary keratodermas, hereditary neu- 
ropathies such as Charcot-Marie-Tooth disease and 
neurofibromatosis, hypothyroidism, hydrocephalus, seizure 
disorders such as Syndenham*s chorea and cerebral palsy, 
spina bifida, anencephaly, craniorachischisis, congenital 
glaucoma, cataract, and sensorineural hearing loss; an 
autoimmune/inflammatory disorder such as acquired immu- 
nodeficiency syndrome (AIDS), Addison*s disease, adult 
respiratory distress syndrome, allergies, ankylosing 
spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyenodocrinopathy-candidiasis-ectodermal 
dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes melUtus, emphysema, episodic 
lymphopenia with lymphocytotoxins, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout. Graves* 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable 
bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, 
osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's 
syndrome, systemic anaphylaxis, systemic lupus 
erythematosus, systemic sclerosis, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Werner syndrome, com- 
plications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and a metabolic disorder 
such as Addison's disease, cystic fibrosis, diabetes, fatty 
hepatocirrhosis, galactosemia, goiter, hyperadrenalism. 
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hypoadrenalism, hyperparathyroidism, hypoparathyroidism, 
hypercholesterolemia, hyperthyroidism, hypothyroidism 
hyperlipidemia, hyperlipemia, lipid myopathies, obesity, 
lipodystrophies, and phenylketonuria, congenital adrenal 
hyperplasia, pseudovitamin D-deficiency rickets, cerebro- 
tendinous xanthomatosis, and coumarin resistance. The 
polynucleotide sequences encoding HUCYP may be used in 
Southern or northern analysis, dot blot, or other membrane- 
based technologies; in PGR technologies; in dipstick, pin, 
and multiformat ELISA-like assays; and in micro arrays 
utilizing fluids or tissues from patients to detect altered 
HUCYP expression. Such qualitative or quantitative meth- 
ods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding 
HUCYP may be useful in assays that delect the presence of 
associated disorders, particularly those mentioned above. 
The nucleotide sequences encoding HUCYP may be labeled 
by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of 
hybridization complexes. After a suitable incubation period, 
the sample is washed and the signal is quantitated and 
compared with a standard value. If the amount of signal in 
the patient sample is significantly altered in comparison to a 
control sample then the presence of altered levels of nucle- 
otide sequences encoding HUCYP in the sample indicates 
the presence of the associated disorder. Such assays may 
also be used to evaluate the efficacy of a particular thera- 
peutic treatment regimen in animal studies, in clinical trials, 
or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder 
associated with expression of HUCYP, a normal or standard 
profile for expression is established. This may be accom- 
plished by combining body fluids or cell extracts taken from 
normal subjects, either animal or human, with a sequence, or 
a fragment thereof, encoding HUCYP, under conditions 
suitable for hybridization or amplification. Standard hybrid- 
ization may be quantified by comparing the values obtained 
from normal subjects with values from an experiment in 
which a known amount of a substantially purified polynucle- 
otide is used. Standard values obtained in this manner may 
be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a 
disorder. 

Once the presence of a disorder is established and a 
treatment protocol is initiated, hybridization assays may be 
repeated on a regular basis to determine if the level of 
expression in the patient begins to approximate that which is 
observed in the normal subject. The results obtained from 
successive assays may be used to show the eflHcacy of 
treatment over a period ranging from several days to months. 

With respect to cancer, the presence of an abnormal 
amount of transcript (either under- or over-expressed) in 
biopsied tissue firom an individual may indicate a predispo- 
sition for the development of the disease, or may provide a 
means for delecting the disease prior to the appearance of 
actual clinical symptoms. A more definitive diagnosis of this 
type may allow health professionals to employ preventative 
measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed 
from the sequences encoding HUCYP may involve the use 
of PGR. These oligomers may be chemically synthesized, 
generated enzymatically, or produced in vitro. Oligomers 
wiU preferably contain a fragment of a polynucleotide 
encoding HUCYP, or a fragment of a polynucleotide 
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complementary to the polynucleotide encoding HUCYP, and 
will be employed under optimized conditions for identifi- 
cation of a specific gene or condition. Oligomers may also 
be employed under less stringent conditions for detection or 

5 quantitation of closely related DNA or RNA sequences. 

Methods which may also be used to quantitate the expres- 
sion of HUCYP include radiolabeling or biotinylating 
nucleotides, co amplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, 

10 P. C. et al. (1993) J. Immunol. Methods 159:235-244; 
Duplaa, C. et al. (1993) Anal. Biochem. 229-236.) The 
speed of quantitation of multiple samples may be acceler- 
ated by running the assay in an ELISA format where the 
oligomer of interest is presented in various dilutions and a 

15 spectrophotometric or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer frag- 
ments derived from any of the polynucleotide sequences 
described herein may be used as targets in a micro array. The 
microarray can be used to monitor the expression level of 
large numbers of genes simultaneoxisly and to identify 
genetic variants, mutations, and polymorphisms. This infor- 
mation may be used to determine gene function, to under- 
stand the genetic basis of a disorder, to diagnose a disorder, 
and to develop and monitor the activities of therapeutic 
agents. 

Microarrays may be prepared, used, and analyzed using 
methods known in the art. (See, e.g., Brennan, X M. et al. 
(1995) U.S. Pat. No. 5,474,796; Schena, M. et al. (1996) 
Proc. Natl. Acad. Sci. 93:10614-10619; Baldeschweiler et 
al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R. A. et al. 
(1997) Proc. Natl. Acad. Sci. 94:2150-2155; and Heller, M. 
J. et al. (1997) U.S. Pat. No. 5,605,662.) 

In another embodiment of the invention, nucleic acid 
sequences encoding HUCYP may be used to generate 
hybridization probes useful in mapping the naturally occur- 
ring genomic sequence. The sequences may be mapped to a 
4Q particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., 
human artificial chromosomes (HACs), yeast artificial chro- 
mosomes (YACs), bacterial artificial chromosomes (BACs), 
bacterial PI constructions, or single chromosome cDNA 
45 libraries. (See, e.g., Harrington, J. J. et al. (1997) Nat Genet. 
15:345-355; Price, C. M. (1993) Blood Rev. 7:127-134; and 
Trask, B. J. (1991) Trends Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be corre- 
lated with other physical chromosome mapping techniques 
50 and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) 
in Meyers, supra, pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the 
Online Mendelian Inheritance in Man (OMIM) site. Corre- 
lation between the location of the gene encoding HUCYP on 
55 a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder. The nucleotide 
sequences of the invention may be used to detect differences 
in gene sequences among normal, carrier, and affected 
60 individuals. 

In situ hybridization of chromosomal preparations and 
physical mapping techniques, such as linkage analysis using 
established chromosomal markers, may be used for extend- 
ing genetic maps. Often the placement of a gene on the 
65 chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the number or arm of 
a particular human chromosome is not known. New 
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sequences can be assigned to chromosomal arms by physical 
mapping, lliis provides valuable information to investiga- 
tors searching for disease genes using positional cloning or 
other gene discovery techniques. Once the disease or syn- 
drome has been crudely localized by genetic linkage to a 
particular genomic region, e.g., ataxia-telangiectasia to 
llq22-23, any sequences mapping to that area may repre- 
sent associated or regulatory genes for further investigation. 
(See, e.g., Gatti, R. A. et al. (1988) Nature 336:577-580.) 



Instruments, Fullerton Calif.) for 18 hours at 25,000 rpm at 
ambient temperature. The RNA was extracted with acid 
phenol pH 4.0, precipitated using 0.3 M sodium acetate and 
2.5 volumes of ethanol, resuspended in RNAse-free water 
and treated with DNase at 37° C. The RNA extraction and 
precipitation were repeated as before. The mRNA was 
isolated using an OLIGOTEX kit (QIAGEN, Valencia 
Calif.) and used to construct the cDNA library. 

The mRNA was handled according to the recommended 



(See, e.g., Gatti, R. A. et al. (^ivoe; iNaiurc j^^u.j / /-^ow.^ ^ne mi^i^^ wa:> uau^xv^ ^^..^xw^, — 

The nucleotide sequence of the subject invention may also lO protocols in the SUPERSCRIPT Pl^^^^ ^y^'^™ ^J^jf^ 
be used to detect differences in the chromosomal location Technologies). cDNAs were fractionated on a SEPHARUJ>t, 
due to translocation, inversion, etc., among normal, carrier, CL4B column (Amersham Pharmacia Biotech), and those 
or affected individuals. cDNAs exceeding 400 bp were hgated into pSPORTl 

In another embodiment of the invention, HUCYP, its plasmid (Life technologies^ ^ 

catalytic or immunogenic fragments, or oligopeptides 15 subsequently transformed into DH5a competent cells (Lite 



thereof can be used for screening libraries of compounds in 
any of a variety of drug screening techniques. The fragment 
employed in such screening may be free in solution, afiSxed 
to a solid support, borne on a cell surface, or located 
intracellularly. The formation of binding complexes between 
HUCYP and the agent being tested may be measured. 

Another technique for drug screening provides for high 
throughput screening of compounds having suitable binding 
affinity to the protein of interest. (See, e.g., Geysen, et al. 
(1984) PCT application WO84/03564.) In this method, large 
numbers of different small test compounds are synthesized 
on a solid substrate. The test compounds are reacted with 
HUCYP, or fragments thereof, and washed. Bound HUCYP 
is then detected by methods well known in the art. Purified 
HUCYP can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug 
screening assays in which neutralizing antibodies capable of 
binding HUCYP specifically compete with a test compound 
for binding HUCYP. In this manner, antibodies can be used 
to detect the presence of any peptide which shares one or 
more antigenic determinants with HUCYP. 

In additional embodiments, the nucleotide sequences 
which encode HUCYP may be used in any molecular 
biology techniques that have yet to be developed, provided 
the new techniques rely on properties of nucleotide 
sequences that ' ' '"^ ^ * 



Technologies). 

IL Isolation of cDNA Clones 

Plasmid DNA was released from the cells and purified 
using the R.E.A.L. PREP 96 plasmid kit from QIAGEN. 
This kit enables the simultaneous purification of 96 samples 
in a 96-well block using multi-channel reagent dispensers. 
The recommended protocol was employed except for the 
following changes: 1) the bacteria were cultured in 1 ml of 
sterile Terrific Broth (Life Technologies) with carbenicillin 
at 25 mg/L and glycerol at 0.4%; 2) after inoculation, the 
cultures were incubated for 19 hours and at the end of 
incubation, the cells were lysed with 0.3 ml of lysis buffer; 
and 3) following isopropanol precipitation, the plasmid 
DNA pellet was resuspended in 0.1 ml of distilled water. 
After the last step in the protocol, samples were transferred 
to a 96-well block for storage at 4** C. 

III. Sequencing and Analysis 

The cDNAs were prepared for sequencing using the ABI 
CATALYST 800 (Perkin-Elmer) or the HYDRA microdis- 
penser (Robbins Scientific) or MICROLAB 2200 
(Hamilton) systems in combination with PTC-200 thermal 
40 cyclers (MJ Research). The cDNAs were sequenced using 
the ABI PRISM 373 or 377 sequencing systems (Perkin- 
Elmer) and standard ABI protocols, base caUing software, 
and kits. In one alternative, cDNAs were sequenced using 
the MEGABACE 1000 DNAsequencing system (Molecular 
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fled and sequenced using the ABI PRISM BIGDYE Termi- 
nator cycle sequencing ready reaction kit (Perkin-Elmer), In 
yet another alternative, cDNAs were sequenced using solu- 
tions and dyes from Amersham Pharmacia Biotech. Reading 
50 frames for the ESTs were determined using standard meth- 
ods (reviewed in Ausubel, 1997, supra, unit 7.7). Some of 
the cDNA sequences were selected for extension using the 
techniques disclosed in Example V. 

The polynucleotide sequences derived from cDNA, 
55 extension, and shotgun sequencing were assembled and 
analyzed using a combination of software programs which 
utilize algorithms weU known to those skilled in the art. 
Table 1 summarizes the tools, programs, and algorithms 
used and provides descriptions, references, and threshold 



Hmited to, such properties as the triplet genetic code and 
specific base pair interactions. 

The examples below are provided to illustrate the subject 
invention and are not included for the purpose of limiting the 
invention. 

EXAMPLES 
I. cDNA Library Construction 

The COLNNOTll cDNA library was constructed from 
microscopically normal colon tissue obtained from a 
60-year-old Caucasian male who had undergone a left 
hemicolectomy to remove grade 3 (of 4) adenocarcinoma in 

a different part of his bowel. The patient history reported useu auu pi^vx^v-^ .-^ _ _ 

previous diagnoses of depressive disorder and 60 parameters when appUcable. The first column ot lable 1 

Thrombophlebitis, accompanied by inflammatory pol- shows the tools, programs, and algorithms used, the second 

varthropathies and inflammatory disease of the prostate. column provides brief descriptions the third column pre- 

-nie frozen tissue was homogenized and lysed using a sents the appropriate references, all of which ary^<^n>o^ 

Pob^ror^-3000 homogenizer^Brinkmann Instruments, rated by reference herein in their -"^^f ^^^^^^^^^ 
Westbury N J) in guanidinium isothiocyanate solution. The 65 column presents, where applicable, the scores, P^oJ^^^^ 

ErwL centrifuged over a 5.7 M CsCl cushion using a values, and other parameters used to evaluate the strength of 

SW28 rotoT in a L8-70M ultracentrifuge (Beckman a match between two sequences (the higher the score, the 
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greater the homology between two sequences). Sequences 
were analyzed using MACDNASIS PRO software (Hitachi 
Software Engineering) and LASERGENE software 
(DNASTAR). 

The polynucleotide sequences were validated by remov- 5 
ing vector, linker, and polyA sequences and by masking 
ambiguous bases, using algorithms and programs based on 
BLAST, dynamic programming, and dinucleotide nearest 
neighbor analysis, the sequences were then queried against 
a selection of public databases, such as the GenBank 
primate, rodent, mammalian, vertebrate, and eukaryote 
databases, and BLOCKS to acquire annotation, using pro- 
grams based on BLAST, FASTA, and BLIMPS. The 
sequences were assembled into full length polynucleotide 
sequences using programs based on Phred, Phrap, and 15 
Consed, and were screened for open reading frames using 
programs based on GeneMark, BLAST, and FASTA. The 
full length polynucleotide sequences were translated to 
derive the corresponding full length amino acid sequences, 
and these full length sequences were subsequently analyzed 2G 
by querying against databases such as the GenBank 
databases, SwissProt, BLOCKS, PRINTS, PFAM, and Pros- 
ite. 

The programs described above for the assembly and 
analysis of full length polynucleotide and amino acid 
sequences were used to identify polynucleotide sequence 
fragments from SEQ ID NO:2. 

Fragments from about 20 to about 4000 nucleotides which 
are useful in hybridization and ampUfication technologies 
were described in the Invention section above. 



logic. The disease categories included cancer, inflammation/ 
trauma, fetal, neurological, and pooled. For each category, 
the number of libraries expressing the sequence of interest 
was counted and divided by the total number of libraries 
across all categories. Percentage values of tissue-specific 
and disease expression are reported in the description of the 
invention. 

V. Extension of HUCYP Encoding Polynucleotides 



IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect 
the presence of a transcript of a gene and involves the 
hybridization of a labeled nucleotide sequence to a mem- 
brane on which RNAs from a particular cell type or tissue 
have been bound. (See, e.g., Sambrook, supra, ch. 7; 
Ausubel, 1995, supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were 
used to search for identical or related molecules in nucle- 
otide databases such as the GenBank or LIFESEQ databases 
(Incyte Pharmaceuticals, Palo Alto Calif.). This analysis is 
much faster than multiple membrane-based hybridizations. 
In addition, the sensitivity of the computer search can be 
modified to determine whether any particular match is 
categorized as exact or similar. The basis of the search is the 
product score, which is defined as: 

% sequence identity x % nnaxin ium BLAST score 
100 

The product score takes into account both the degree of 
similarity between two sequences and the length of the 
sequence match. For example, with a product score of 40, 
the match will be exact within a 1% to 2% error, and, with 
a product score of 70, the match wUl be exact. Similar 
molecules are usually identified by selecting those which 
show product scores between 15 and 40, although lower 
scores may identify related molecules. 

The results of northern analyses reported a percentage 
distribution of libraries in which the transcript encoding 
HUCYP occurred. Analysis involved the categorization of 
cDN A libraries by organ/tissue and disease. The organ/tissue 
categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/ 
immune, musculoskeletal, nervous, reproductive, and uro- 



The full length nucleic acid sequence of SEQ ID NO:2 
was produced by extension of an appropriate fragment of the 
full length molecule using oligonucleotide primers designed 
from this fragment. One primer was synthesized to initiate 
5' extension of the known fragment, and the other printer, to 
initiate 3' extension of the known fragment. The initial 
primers were designed using OLIGO 4.06 software 
(National Biosciences), or another appropriate program, to 
be about 22 to 30 nucleotides in length, to have a GC content 
of about 50% or more, and to anneal to the target sequence 
at temperatures of about 68** C. to about 72° C. Any stretch 
of nucleotides which would resuh in hairpin structures and 
primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the 
sequence. If more than one extension was necessary or 
desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using 
methods well known in the art. PCR was performed in 
96-well plates using the PTC-200 thermal cycler (MJ 
3Q Research, Inc.). The reaction mix contained DNA template, 
200 nmol of each primer, reaction buffer containing Mg^*, 
(NH4)2S04, and p-mercaptoethanol, Taq DNA polymerase 
(Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), 
with the following parameters for primer pair PCI A and PCI 
B: Step 1: 94° C, 3 min; Step 2: 94° C, 15 sec; Step 3: 60° 
C, 1 min; Step 4: 68° C, 2 min; Step 5: Steps 2, 3, and 4 
repeated 20 times; Step 6: 68° C, 5 min; Step 7: storage at 
4° C. In the alternative, the parameters for primer pair 1^ 
and SK+ were as follows: Step 1: 94° C, 3 min; Step 2: 94° 
C, 15 sec; Step 3: 57° C, 1 min; Step 4: 68° C, 2 min; Step 
5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68° C, 5 min; 
Step 7: storage at 4° C. 

The concentration of DNA in each well was determined 
45 by dispensing 100 jul PICO GREEN quantitation reagent 
(0.25% (v/v) PICO GREEN; Molecular Probes, Eugene 
Org.) dissolved in IX TE and 0.5 of undiluted PCR 
product into each well of an opaque fluorimeter plate 
(Corning Costar, Acton Mass.), allowing the DNA to bind to 
the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluores- 
cence of the sample and to quantify the concentration of 
DNA. A 5 ^1 to 10 /il aliquot of the reaction mixture was 
analyzed by electrophoresis on a 1% agarose mini-gel to 
determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, 
transferred to 384-well plates, digested with CviJI cholera 
virus endonuclease (Molecular Biology Research, Madison 
60 Wis.), and sonicated or sheared prior to religation into pUC 
18 vector (Amersham Pharmacia Biotech). For shotgun 
sequencing, the digested nucleotides were separated on low 
concentration (0.6 to 0.8%) agarose gels, fragments were 
excised, and agar digested with Agar ACE (Promega). 
65 Extended clones were religated using T4 ligase (New 
England Biolabs, Beveriy Mass.) into pUC 18 vector 
(Amersham Pharmacia Biotech), treated with Pfu DNA 
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polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed 
cells were selected on antibiotic -containing media, indi- 
vidual colonies were picked and cultured overnight at 37* C. 
in 384-weU plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR 
using Taq DNA polymerase (Amersham Pharmacia Biotech) 
and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94** C, 3 min; Step 2: 94** C, 15 sec; 
Step 3: 60° C, I min; Step 4: 72** C, 2 min; Step 5: step 2, 
3, and 4 repeated 29 times; Step 6: 72** C, 5 min; Step 7: 
storage at 4° C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples 
with low DNA recoveries were reamplified using the same 



fluorescence. The degree of complementarity and the rela- 
tive abundance of each probe which hybridizes to an ele- 
ment on the microarray may be assessed through analysis of 
the scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or 
fragments thereof may comprise the elements of the 
microarray. Fragments suitable for hybridization can be 
selected using software well known in the art such as 
LASERGENE software (DNASTAR), FuU-length cDNAs, 
ESTs, or fragments thereof corresponding to one of the 
nucleotide sequences of the present invention, or selected at 
random from a cDNA library relevant to the present 
invention, are arranged on an appropriate substrate, e.g., a 
glass slide. The cDNA is fixed to the slide using, e.g., UV 



conditions as described above. Samples were diluted with cross-linking followed by thermal and chemical treatments 



20% dimethysulphoxide (1:2, v/v), and sequenced using 
D YEN AMI C energy transfer sequencing primers and the 
DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) 
or the ABI PRISM BIGDYE Terminator cycle sequencing 
ready reaction kit (Perkin- Elmer). 

In like manner, the nucleotide sequence of SEQ ID NO: 2 
is used to obtain 5* regulatory sequences using the procedure 
above, oligonucleotides designed for such extension, and an 
appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization 
Probes 

Hybridization probes derived from SEQ ID NO:2 are 
employed to screen cDNAs, genomic DNAs, or mRNAs. 
Although the labeling of oligonucleotides, consisting of 
about 20 base pairs, is specifically described, essentially the 
same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art soft- 
ware such as OLIGO 4.06 software (National Biosciences) 
and labeled by combining 50 pmol of each oligomer, 250 
//Ci of p^P]-adenosine triphosphate (Amersham Pharmacia 
Biotech), and T4 polynucleotide kinase (DuPont NEN, 
Boston Mass.). The labeled oligonucleotides are substan- 
tially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia 
Biotech). An aliquot containing 10'' counts per minute of the 
labeled probe is used in a typical membrane-based hybrid- 
ization analysis of human genomic DNA digested with one 
of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, 
Xbal, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% 
agarose gel and transferred to nylon membranes (Nytran 
Plus, Schleicher & Schuell, Durham N.H.). Hybridization is 
carried out for 16 hours at 40° C. To remove nonspecific 
signals, blots are sequentially washed at room temperature 
under increasingly stringent conditions up to O.lxsaline 
sodium citrate and 0.5% sodium dodecyl sulfate. Hybrid- 
ization patterns are visualized and compared using autora- 
diography. 

VII. Microarray s 

A chemical coupling procedure and an ink jet device can 
be used to synthesize array elements on the surface of a 
substrate. (See, e.g., Baldeschweiler, supra.) An array analo- 
gous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, 
chemical, or mechanical bonding procedures. A typical array 
may be produced by hand or using available methods and 
machines and contain any appropriate number of elements. 
After hybridization, nonhybridized probes are removed and 
a scanner used to determine the levels and patterns of 
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and subsequent drying. (See, e.g., Schena, M. et al. (1995) 
Science 270:467^70; Shalon, D. et al. (1996) Genome Res. 
6:639-645.) Fluorescent probes are prepared and used for 
hybridization to the elements on the substrate. The substrate 
20 is analyzed by procedures described above. 

VIII. Complementary Polynucleotides 

Sequences complementary to the HUCYP-encoding 
25 sequences, or any parts thereof, are used to detect, decrease, 
or inhibit expression of naturally occurring HUCYP. 
Although use of oligonucleotides comprising from about 15 
to 30 base pairs is described, essentially the same procedure 
is used with smaller or with larger sequence fragments. 
3Q Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding 
sequence of HUCYP. To inhibit transcription, a complemen- 
tary oligonucleotide is designed from the most unique 5' 
sequence and used to prevent promoter binding to the coding 
sequence. To inhibit translation, a complementary oligo- 
nucleotide is designed to prevent ribosomal binding to the 
HUCYP-encoding transcript. 

IX. Expression of HUCYP 

Expression and purification of HUCYP are achieved 
using bacterial or virus-based expression systems. For 
expression of HUCYP in bacteria, cDNA is subcloned into 
an appropriate vector containing an antibiotic resistance 
gene and an inducible promoter that directs high levels of 
cDNA transcription. Examples of such promoters include, 
but are not limited to, the trp-lac (tac) hybrid promoter and 
the T5 or T7 bacteriophage promoter in conjunction with the 
lac operator regulatory element. Recombinant vectors are 
transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express HUCYP upon induction 
with isopropyl beta-D-thiogalactopyranoside (IPTG). 
Expression of HUCYP in eukaryotic cells is achieved by 
infecting insect or mammalian cell lines with recombinant 
Autographica calif ornica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculo virus. The nones- 
sential polyhedrin gene of baculovirus is replaced with 
cDNA encoding HUCYP by either homologous recombina- 
tion or bacterial-mediated transposition involving transfer 
plasmid intermediates. Viral infectivity is maintained and 
the strong polyhedrin promoter drives high levels of cDNA 
transcription. Recombinant baculovirus is used to infect 
Spodoptera frugiperda (Sf9) insect cells in most cases, or 
human hepatocytes, in some cases. Infection of the latter 
requires additional genetic modifications to baculovirus. 
(See Engelhard. E. K. et al. (1994) Proc. Natl. Acad. Sci, 
USA 91:3224-25 3227; Sandig, V. et al. (1996) Hum. Gene 
Ther. 7:1937-1945.) 
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In most expression systems, HUCYP is synthesized as a 
fusion protein with, e.g., glutathione S-transferase (GST) or 
a peptide epitope tag, such as FLAG or 6-His, permitting 
rapid, single-step, afi&nity-based purification of recombinant 
fusion protein from crude cell lysates. GST, a 26-ldlodalton 
enzyme from Schistosoma japonicum, enables the purifica- 
tion of fusion proteins on immobilized glutathione under 
conditions that maintain protein activity and antigenicity 
(Amersham Pharmacia Biotech). Following purification, the 



Flow cytometry (FCM), an automated, laser optics-based 
technique, is used to identify transfected cells expressing 
GFP or CD64-GFP, and to evaluate cellular properties, for 
example, their apoptotic state, FCM detects and quantifies 
the uptake of fluorescent molecules that diagnose events 
preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by 
staining of DNA with propidium iodide; changes in cell size 
and granularity as measured by forward light scatter and 90 



(Amersham Fnarmacia iiioiecn;. rouowiiiij puiiuv-dinju, ^ ^ — - 

GST moiety can be proteolytically cleaved from HUCYP at lo degree side light scatter; down -regulation of DNA synthesis 
. 1 r-T Az-i o ,* n-.^Qciir/n.ri U\i rlRY'rpa<ie in bmmodeox vundiue uotake: 



specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoafifinity purification using commer- 
cially available monoclonal and polyclonal anti-FLAG anti- 
bodies (Eastman Kodak). 6-His, a stretch of six consecutive 
histidine residues, enables purification on metal-chelate res- 
ins (QIAGEN). Methods for protein expression and purifi- 
cation are discussed in Ausubel (1995, supra, ch 10 and 16). 
Purified HUCYP obtained by these methods can be used 
directly in the following activity assay. 

X. Demonstration of HUCYP Activity 

HUCYP activity is demonstrated as the ability to 
hydroxylate the steroid testosterone (Waxman, D. J. (1991) 
Methods Enzymol. 206:462-476). 4-^'^C-testosterone is 



as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular 
proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as mea- 
15 siu-ed by the binding of fluoresce in-conjugated Annexin V 
protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M. G. (1994) Flow Cytometry, 
Oxford, New York N.Y. 

The influence of HUCYP on gene expression can be 
20 assessed using highly purified populations of cells trans- 
fected with sequences encoding HUCYP and either CD64 or 
CD64-GFP. CD64 and CD64-GFP are expressed on the 
surface of transfected cells and bind to conserved regions of 
human immunoglobulin G (IgG). Transfected cells are effi- 



Methods Enzymol 206:462-476). 4- C-testosterone is ^^^^^1 ^ ^^d from nontransfected cells using magnetic 
diluted with unlabeled testosterone dissolved in toluene to ^-.u -.u^, human l^G or antibody against 



give a working solution of 5-7 mCi/mmol. Ten nanomoles 
of the working solution of 4-^'^C-lestosterone (final assay 
concentration 50 //M) are aliquoted to a 13x100 mm assay 
tube, and the solvent is evaporated under N2. HUCYP, 0.1 M 
HEPES buffer, pH 7,4, and 0.1 mM EDTA are added to the 
tube, on ice, to give a final volume of 175 jul A control tube 
is prepared that lacks HUCYP. Samples are vortexed briefly, 
and the tubes are transferred to a shaking 37° C. water bath. 
Reactions are initiated 4 minutes later by the addition of 
NADPH dissolved in 25 /il 0.1 M HEPES buffer, pH 7.4 to 
give a final NADPH concentration of 0.3 mM. Reactions are 
terminated after 10-20 minutes by the addition of 1 ml ethyl 
acetate and vortexed for 30 seconds. The hydroxylated 
products are separated from testosterone by thin layer chro- 
matography and detected by autoradiography. Product 
migrations are compared with those of testosterone and of 
hydroxylated steroid standards to identify the products. 
Reaction products are cut out of the thin layer chromatog- 
raphy plate and counted in a liquid scintillation counter. The 
amount of hydroxylated steroid products found is propor- 
tional to HUCYP activity. 

XI. Functional Assays 
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beads coated with either human IgG or antibody against 
CD64 (DYNAL, Lake Success N.Y). mRNAcan be purified 
from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding HUCYP and other 
genes of interest can be analyzed by northern analysis or 
microarray techniques. 

XII. Production of HUCYP Specific Antibodies 

HUCYP substanliaUy purified using polyacrylamide gel 
electrophoresis (PAGE; see, e.g., Harrington, M. G. (1990) 
Methods Enzymol. 182:488-495), or other purification 
techniques, is used to immunize rabbits and to produce 
antibodies using standard protocols. 

Alternatively, the HUCYP amino acid sequence is ana- 
lyzed using LASERGENE software (DNASTAR) to deter- 
mine regions of high immunogenicity, and a corresponding 
oligopeptide is synthesized and used to raise antibodies by 
means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the 
C-terminus or in hydrophilic regions are well described in 
the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) 

Typically, oligopeptides 15 residues in length are synthe- 
sized using an ABI 431A Peptide Synthesizer (Perkin- 



HUCYP function is assessed by express^ing the sequences 50 T^le" f^^of'^n 
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encoding HUCYP at physiologically elevated levels in 
mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter 
that drives high levels of cDNA expression. Vectors of 
choice include pCMV SPORT (Life Technologies) and 
pCR3.1 (Invitrogen, Carlsbad Calif.), both of which contain 
the cytomegalovirus promoter. 5-10 ^g of recombinant 
vector are transiently transfected into a human cell line, 
preferably of endothelial or hematopoietic origin, using 
either liposome formulations or electropo ration. 1-2 ^g of 
an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker 
protein provides a means to distinguish transfected cells 

from nontransfected cells and is a reliable predictor of ..^..j j ^ . - ^ 

cDNA expression fi-om the recombinant vector. Marker 65 antibodies specific for HUCYP. An immunoaffimty column 
proteins of choice include, e.g.. Green Buorescent Protein is constructed by covalently coupling anti-HUCYP antibody 
(GFP- Clontech), CD64, or a CD64-GFP fusion protein. to an activated chromatographic resin, such as CNBr- 



Aldrich, St. Louis Mo.) by reaction with 
N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) 
to increase immunogenicity. (See, e.g., Ausubel, 1995, 
supra.) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera 
are tested for antipeptide activity by, for example, binding 
the peptide to plastic, blocking with 1% BSA, reacting with 
rabbit antisera, washing, and reacting with radio-iodinated 
goat anti-rabbit IgG. 

XIIL Purification of NaturaUy Occurring HUCYP 
Using Specific Antibodies 

Naturally occurring or recombinant HUCYP is substan- 
tially purified by immunoafifinity chromatography using 
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activated SEPHAROSE (Amersham Pharmacia Biotech). 
After the coupling, the resin is blocked and washed accord- 
,ing to the manufacturer's instructions. 

Media containing HUCYP are passed over the immunoaf- 
finity column, and the column is washed under conditions 
that allow the preferential absorbance of HUCYP (e.g., high 
ionic strength buffers in the presence of detergent). The 
column is eluted under conditions that disrupt antibody/ 
HUCYP binding (e.g., a buffer of pH 2 to pH 3, or a high 
concentration of a chaotrope, such as urea or thiocyanate 
ion), and HUCYP is collected. 

XIV. Identification of Molecules Which Interact 
with HUCYP 

HUCYP, or biologically active fragments thereof, are 
labeled with ^^I Bolton-Hunter reagent. (See, e.g., Bolton et 
al. (1973) Biochem. J. 133:529-539.) Candidate molecules 
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previoiLsly arrayed in the wells of a multi-weii plate are 
incubated with the labeled HUCYP, washed, and any wells 
with labeled HUCYP complex are assayed. Data obtained 
using different concentrations of HUCYP are used to cal- 
culate values for the number, afSnity, and association of 
HUCYP with the candidate molecules. Various modifica- 
tions and variations of the described methods and systems of 
the invention will be apparent to those skilled in the art 
without departing from the scope and spirit of the invention. 
Although the invention has been described in connection 
with specific preferred embodiments, it should be under- 
stood that the invention as claimed should not be unduly 
limited to such specific embodiments. Indeed, various modi- 
fications of the described modes for carrying out the inven- 
tion which are obvious to those skilled in molecular biology 
or related fields arc intended to be within the scope of the 
following claims. 



SEQUENCE LISTING 



<160> NUMBER OF SEQ ID NOS : 2 

<210> SEQ ID NO 1 

<211> LENGTH; 4 62 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<2 20> FEATURE: - 

<2 23> OTHER INFORMATION: 9 917 29 
<4 00> SEQUENCE: 1 

Met Leu Asp Phe Ala lie Phe Ala Val Thr Phe Leu Leu Ala Leu 

15 10 15 

Val Gly Ala Val Leu Tyr Leu Tyr Pro Ala Ser Arg Gin Ala Ala 

20 25 30 

Gly He Pro Gly He Thr Pro Thr Glu Glu Lys Asp Gly Asn Leu 

35 40 45 

Pro Asp He Val Asn Ser Gly Ser Leu His Glu Phe Leu Val Asn 

50 55 60 

Leu His Glu Arg Tyr Gly Pro Val Val Ser Phe Trp Phe Gly Arg 

65 70 75 

Arg Leu Val Val Ser Leu Gly Thr Val Asp Val Leu Lys Gin His 

80 85 90 

He Asn Pro Asn Lys Thr Leu Asp Pro Phe Glu Thr Met Leu Lys 

95 100 105 

Ser Leu Leu Arg Tyr Gin Ser Gly Gly Gly Ser Val Ser Glu Asn 

110 115 120 

His Met Arg Lys Lys Leu Tyr Glu Asn Gly Val Thr Asp Ser Leu 

125 130 135 

Lys Ser Asn Phe Ala Leu Leu Leu Lys Leu Ser Glu Glu Leu Leu 

140 145 150 

Asp Lys Trp Leu Ser Tyr Pro Glu Thr Gin His Val Pro Leu Ser 

155 160 165 

Gin His Met Leu Gly Phe Ala Met Lys Ser Val Thr Gin Met Val 

170 175 180 

Met Gly Ser Thr Phe Glu Asp Asp Gin Glu Val He Arg Phe Gin 

185 190 195 



Lys Asn His Gly Thr Val Trp Ser Glu He Gly Lys Gly Phe Leu 
200 205 210 



6,130,077 

41 42 



-continued 



Asp Gly Ser Leu Asp Lys Asn Met Thr Arg Lye Lys Gin Tyr Glu 
2X5 220 225 

Asp Ala Leu Met Gin Leu Glu Ser Val Leu Arg Asn lie lie Lys 
230 235 240 

Glu Arg Lys Gly Arg Asn Phe Ser Gin His lie Phe He Asp Ser 
245 250 255 

Leu Val Gin Gly Asn Leu Asn Asp Gin Gin He Leu Glu Asp Ser 
260 265 270 

Met He Phe Ser Leu Ala Ser Cys He He Thr Ala Lys Leu Cye 
275 280 285 

Thr Trp Ala He Cys Phe Leu Thr Thr Ser Glu Glu Val Gin Lys 
290 295 300 

Lys Leu Tyr Glu Glu He Asn Gin Val Phe Gly Asn Gly Pro Val 
305 * 310 315 

Thr Pro Glu Lys He Glu Gin Leu Arg Tyr Cys Gin His Val Leu 
320 325 330 

Cys Glu Thr Val Arg Thr Ala Lys Leu Thr Pro Val Ser Ala Gin 
335 340 345 

Leu Gin Asp He Glu Gly Lys He Asp Arg Phe He He Pro Arg 
350 355 360 

Glu Thr Leu Val Leu Tyr Ala Leu Gly Val Val Leu Gin Asp Pro 
365 370 375 

Asn Thr Trp Pro Ser Pro His Lys Phe Asp Pro Asp Arg Phe Asp 
380 385 390 

Asp Glu Leu Val Met Lys Thr Phe Ser Ser Leu Gly Phe Ser Gly 
395 400 405 

Thr Gin Glu Cys Pro Glu Leu Arg Phe Ala Tyr Met Val Thr Thr 
410 415 420 

Val Leu Leu Ser Val Leu Val Lys Arg Leu His Leu Leu Ser Val 
425 430 435 

Glu Gly Gin Val He Glu Thr Lys Tyr Glu Leu Val Thr Ser Ser 
440 445 450 

Arg Glu Glu Ala Trp He Thr Val Ser Lys Arg Tyr 
455 460 

<210> SEQ ID NO 2 

<211> LENGTH: 1648 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<220> FEATURE: - 

<2 23> OTHER INFORMATION: 99172 9 
<4 00> SEQUENCE; 2 

gccgatccga gacgtggctc cctgggcggc agaaccatgt tggacttcgc gatcttcgcc 60 
gttaccttct tgctggcgtt ggtgggagcc gtgctctacc tctatccggc ttccagacaa 120 
gctgcaggaa ttccagggat tactccaact gaagaaaaag atggtaatct tccagatatt 18 0 



gtgaatagtg gaagtttgca tgagttcctg gttaatttgc atgagagata tgggcctgtg 24 0 



gtctccttct ggtttggcag gcgcctcgtg gttagtttgg gcactgttga tgtactgaag 
cagcatatca atcccaataa gacattggac ccttttgaaa ccatgctgaa gtcattatta 



300 
360 



aggtatcaat ctggtggtgg cagtgtgagt gaaaaccaca tgaggaaaaa attgtatgaa 42 0 



aatggtgtga ctgattctct gaagagtaac tttgccctcc tcctaaagct ttcagaagaa 
ttattagata aatggctctc ctacccagag acccagcacg tgcccctcag ccagcatatg 



480 
540 
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-continued 





ctatgaagtc 


"tgttacacag 


atggtaatgg 


gtagtac att 


tgaagatgat 


600 




ttc gcii'tcca 


g aa gaal:. c at 


ggcacagttt; 


ggtctgagat 


tggaaaaggc 


660 


'k't c a g a't 9 


ggtcacttga 


'baaaaaca'tg 


actcggaaaa 


aacaatatga 


agatgccctc 


720 


Q'tgcQQC'tgg 


agtctgtttt 


a aggaa cat c 


ataaaagaac 


gaaaaggaag 


gaacttcagt 


780 


coacatattt 


tcattgactc 


cttagtacaa 


gggaacctta 


atgaccaaca 


gatcctagaa 


840 


gaca^'ta'tga 


t autttct Ct 


ggccagttgc 


ataat aactg 


caaaattgiig 


tacctigggca 


900 


atctgttt-tt 


taaccacctc 


tgaagaagtt 


caaaaaaaat 


tatatgaaga 


gataaaccaa 


960 


gtttttggaa 


atggtcctgt 


tac-tccagag 


aaaattgagc 


agctcagata 


ftgtcagcat 


1020 


gtgctttgtg 


aaactg-ttcg 


aactgccaaa 


ctgactccag 


tttctgccca 


gc-ttcaagat 


1080 


at-kgaaggaa 


aaattgaccg 


atttat-tatt 


cctagagaga 


ccctcgtcct 


ttatgccctt 


1 140 


ggtg tggtac 


'k'kcagga't cc 


taatac't't gg 




acaag'ttz'tga 


tccagat egg 


1200 


tttgatgatg 


aattagtaat 


gaaaac-fct-tt 


tcG-tcacttg 


gat-tctcagg 


cacacaggag 


1260 


tgtccagagt 


tgaggtttgc 


atatatggtg 


accacagtac 


ttc-ttagtgt 


attggtgaag 


1320 


agactgcacc 


tactttctgt 


ggagggacag 


gtta-ttgaaa 


caaagtatga 


actggtaaca 


1380 


tca-tcaaggg 


aagaagcttg 


gatcactgtc 


tcaaagaga-t 


attaaaattt 


tatacattta 


1440 


aaatcattgt 


taaattgatt 


gaggaaaaca 


accatttaaa 


aaaaatctat 


gttgaatcct 


1500 


tttataaacc 


agta-tcactt 


tgtaatataa 


acacctattt 


gtact-taatt 


ttgtaaattt 


1560 


ggatttttat 


atatcatatt 


ttcttaattc 


attgtacaca 


tttgacttac 


tgcacagtat 


1620 


attgatcatt 


■t-taatgggaa 


aaaaaaaa 
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What is claimed is: 

1. An isolated and purified polynucleotide encoding a 
polypeptide 

(a) comprising SEQ ID NO:l; or 

(b) consisting of a fragment of SEQ ID NO:l from about 40 
amino acid residue Ala at position 25 to about amino 
acid residue Glu at position 39 of SEQ ID NO:l. 

2. An isolated and purified polynucleotide having at least 
85% polynucleotide sequence identity to the polynucleotide 

of claim 1(a) over the full length of the sequence. 45 

3. An isolated and purified polynucleotide having a 
sequence which is complementary to the polynucleotide of 
claim 1. 

4. An isolated and purified polynucleotide 

(a) comprising the polynucleotide sequence of SEQ ID 
NO:2; or 

(b) consisting of a fragment of SEQ ID NO:2 from about 
nucleotide 109 to about nucleotide 153 of SEQ ID 
NO:2. 

5. An isolated and purified polynucleotide having at least 
85% polynucleotide sequence identity to the polynucleotide 
of claim 4(a) over the full length of the sequence. 



6. An isolated and purified polynucleotide having a 
sequence which is complementary to the polynucleotide of 
claim 4. 

7. An expression vector comprising the polynucleotide of 
claim 1. 

8. A host cell comprising the expression vector of claim 

7. 

9. A method for producing a polypeptide, the method 
comprising the steps of: 

(a) culturing the host cell of claim 8 under conditions 
suitable for the expression of the polypeptide; and 

(b) recovering the polypeptide from the host cell culture. 

10. A method for detecting a polynucleotide, the method 
comprising the steps of: 

(a) hybridizing the polynucleotide of claim 3 to at least 
one nucleic acid in a sample, thereby forming a hybrid- 
ization complex; and 

(b) detecting the hybridization complex, wherein the 
presence of the hybridization complex correlates with 
the presence of the polynucleotide in the sample. 

11. The method of claim 10 further comprising amplifying 
the polynucleotide prior to hybridization. 

m m m 4t * 
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[57] ABSTRACT 

Disclosed is a new gene whkib codes for a NADPH cyto- 
chrome P450 oxidcxeductase from filamentous fungi. Also 
disclosed are a recombinant DNA molecule which com- 
prises at least a part of that gene, an RNA molecule derived 
tiierefEom and a new polypeptide (or protein), as well as at 
least one host cell transformed with such recombinant DNA 
mdecule. Farticulaxly suitable host cells are those originat- 
ing from a filamentous fungus. WUh the polypeptides 
(proteins) and the host cells according to the invention, new 
processes for enzymatic conversions can be performed. In 
particular, enzymatic conversions by means of 
m<Hioaxygenases, more particularly by means of enzymes 
from the cytochrome P450 soperfianiily are improved with 
the inventicMi. 
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TGATAACTCC TCAGCAAATC GGAGTAAACA GAAGGACAAG TCATTGGAGT ACTAAGTAGC 60 

TCCGTGTCAG AGACCCGGAC AGGATCAGCT TCTCCGAACC CGAGACTCCG GGCGAAAAGG 120 

CCACCATCGC TCAGGCTACC ACCTGTGTTC CTTCCGTCGA TCGTCCTCCC TCGTTTCCGG 180 

CTCACGGCCC CCCAAATTAT TGCGGTCTGC TTAGCAGTGG GTTCGGCCTC TCTGTTCTTC 240 

CTGGATCACA CCACGGCTTA CTTTCTTATC CTTTTCCTTT TCCTTTCTTC CTTTCTTCCT 300 

GTTCTCCTTT CTTCCmCC ACCCCCTTCT TTCTTTTAAC CCCATAGCGT CATTCTTTCT 360 

TCCGTTTTAT CTTTGGTTTT GGGACGCCGC CACCTTATCT CGGTTCCTGC CTCGGTCTCC 420 

GGTGATCGCA CCTGGATAGG CTAAGCGTAG GGAGGTGTGA CATTCTTCTT TCACCTCCTC 480 

TCCTTTTCCC GCCTCACTCC GTTCAATCCC CCGCTCCACC CTTTCAGACT CGCCATCGTA 54 0 

TCAAGTCGGG GCCTTTGCTT GCGCCGCTGA ACAGCCTCAC CATGGCGCAA CTCGATACCC 600 

TCGATCTGGT GGTCCTGGCG GTGCTTTTGG TGGGTAGCGT GGCCTACTTC ACCAAGGGCA 660 

CCTACTGGGC AGTTGCAAAG ACCCGTATGC CTCTACCGGC CCCGCGGATG AACGGCGCCG 720 

rTAAGGCTGG CAAGACTCGG AACATCATTG AGAAGATGGA AGAAACGGGC AAGAATTGTG 780 

TTATTTTCTA CGGATCGCAA ACTGGAACCG CTGAGGACTA CGCCTCCAGA TTGGCCAAGG 84 0 

AAGGATCTCA GCGCTTCGGC CTCAAGACCA TGGTGGCTGA CCTCGAGGAA TACGACTATG 900 

AGAACCTGGA CCAATTCCCG GAGGACAAGG TTGCGmTT CGTGCTCGCC ACCTACGGAG 960 

AGGGTGAGCC TACGGATAAT GCTGTTGAGT TCTACCAGTT CTTCACCGGT GACGACGTTG 1020 

CTTTTGAGAG CGCGTCCGCG GACGAGAAGC CTCTOTCCAA GCTGAAGTAT GTTGCTTTCG 1080 

GTCTGGGTAA CAACACTTAT GAGCACTACA ACGCCATGGT TCGTCAAGTC GATGCTGCTT 1140 

TCCAGAAGCT CX^GGCCGCAG CGTATTGGTT CTGCTGGCGA GGGTGATGAC GGTGCCGGTA 1200 

CAATGGAAGA AGACTTCTTG GCCTGGAAGG AGCCCATGTG GGCAGCACTG TCGGAGTCGA 1260 

TiGGATCTCGA AGAGCGTGAA GCGGTCTACG AACCTGTTTT CTGCGTCACC GAAAACGAGT 1320 

CCCTGAGCCC TGAGGACGAG ACGGTCTATC TTGGAGAGCC CACCCAGAGC CACCTTCAGG 13 80 

GTACTCCCAA AGGCCCGTAC TCTCCGCACA ACCCCTTTAT CGCCCCTATT GCCGAATCTC 1440 

GTGAGCTTTT CACCGTCAAG GATCGCAACT GTCTGCACAT GGAAATTAGC ATCGCTGGAA 1500 

GTAACTTGTC CTACCAGACT GGTGACCACA TCGCIX J 'AT TC GCCCACAAAC GCT GG T G CCG 1560 

AAGTGGATCG GTTCCTTCAG GTCTTCGGTC TCGAGGGCAA GCGTGATTCG GTCATCAACA 1620 

TCAAGGGTAT CGATGTTACG GCCAAGGTCC CAATCCCGAC CCCGACCACG TACGATGCCG 1680 

CTGTTCGGTA CTATATGGAA GTCTGCGCCC CTGTGTCCCG TCAGTTTGTA GCCACTCTGG 174 0 

CCGCGTTCGC TCCGATGAGG AAAGCAAGGC AGAGATTGTG CGTCTGGGTA GCACAAGGAC 1800 

TATTTCCACG AGAAGGTCAC CAACCAATGC TTCAACATGC CCAGGCTCTT CAGAGCATCA 1860 

CGTCCAAGCC TTTCTCTGCT GTTCCGTTCT CTCTGCTTAT TGAAGGCATT ACGAAGCTGC 1920 

AGCCTCGCTA CTACTCGATC TCTTCGTCCT CCCTTGTCCA GAAGGACAAG ATCAGCATCA 1980 

C3GCCXSTTGT GGAATCTGTT CXTTCTGCCCG GTGCCTCTCA CATGGTGAAG GGTGTGACTA 2O4 0 

Fiai 
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Thr Ala Lvs Vai Pro He Pro Thr Pro Thr Thr Tyr Asp Ala Ala Val 
355 360 365 

Mec Glu val Cys Ala Pro Val Ser Arg Gin Phe Val Ala 
370 375 380 

Thr Leu Ala Ala Phe Ala Pro Met Arg Lys Ala Arg Gin Arg Leu Cys 
385 390 395 400 

Val Trp val Ala Gin Gly Leu Phe Pro Arg Glu Gly His Gin Pro Met 
405 410 415 

Leu Gin His Ala Gin Ala Leu Gin Ser He Thr Ser Lys Pro Phe Ser 
420 425 430 

Ala Val Pro Phe Ser Leu Leu He Glu Gly He Thr Lys Leu Gin Pro 
435 440 445 

ArQ Tvr Tyr Ser He Ser Ser Ser Ser Leu Val Gin Lys Asp Lys He 
^ 450 455 460 

Ser He Thr Ala Val Val Glu Ser Val Arg Leu Pro Gly Ala Ser His 
465 470 475 480 

Met Val Lvs Gly Val Thr Thr Asn T^/r Leu Leu Ala Leu Lys Gin Lys 
485 490 495 

Gin Asn Glv Arg Ser Leu Ser Arg Pro Ser Arg Leu Asp Leu Leu His 
* 500 505 510 

His Gly Pro Arg Asn Lys Tyr Asp Gly He His Val Pro Val His Val 
515 520 525 

Ara His Ser Asn Phe Lys Leu Pro Ser Asp Pro Ser Arg Pro He He 
^ 530 535 540 

Met Val Gly Pro Glv Thr Gly Val Ala Pro Phe Arg Gly Phe He Gin 
545 ' 550 555 560 

Glu Arg Ala Ala Leu Ala Ala Lys Gly Glu Lys Val Gly Pro Thr Val 
565 570 575 

Leu Phe Phe Gly Cys Arg Lys Ser Asp Glu Asp Phe Leu Tyr Lys Asp 
580 585 590 

Glu Trp Lys Thr Tyr Gin Asp Gin Leu Gly Asp Asn Leu Lys He He 
595 600 605 

Thr Ala Phe Ser Arg Glu Gly Pro Gin Lys Val Tyr Val Gin His Arg 
610 615 620 

Leu Ara Glu His Ser Glu Leu Val Ser Asp Leu Leu Lys Gin Lys Ala 
625 630 €35 640 

Thr Phe Tyr Val Cys Gly Asp Ala Ala Asn Met Ala Arg Glu Val Asn 
645 650 655 

Leu Val Leu Glv Gin He He Ala Ala Gin Arg Gly Leu Pro Ala Glu 
660 665 670 

Lvs Glv Glu Glu Met Val Lys His Met Arg Arg Arg Gly Arg Tyr Gin 
^ 675 680 685 

Glu Asp Val Trp Ser 
690 
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OXIDOREDUCXASE FROM FILAMENTOUS 

FUNGI, DNA CODING THEREFOR AND 
CELLS TRANSFORMED WITH SAID DNA 

BACKGROUND OF THE INVENTION 

The inventioii relates to a new gene, a recombinant DNA 
molecule ccHxqMising at least a part of that gene, an RNA 
molecule derived therefrom and a new polypeptide (or 
protein), as well as a host cell transfanned at least with such. 
lecombinant DNA molecule, in paiticular a transfosmed 
filamentous fungus. In addition, the invention relates to 
processes few the use of the new ptotcin or the new host cell 
in enzymatic conversions using monooxygenases, more 
specifically en^rmes from the cytochrome P450 stq>erfam- 
ily. 

In particular, the invention telates to ih& isolatioo, chai- 
acterizatioD and use of a gene coding for an NADPH 
cytochrome P450 oxidoreductase from filamentous fungi 
Further, this sppbcsdon discloses the use of this gene for 
increasing cytochrome P450-mediated enzymatic activities. 

In nature, monooxygenase reactions are responsible for a 
laige number of conversions of both endogenous and exog- 
enous conqxHmds. The monooxygenases can be divided into 
many different classes of enzymes. One of those classes is 
tfie family of cytochrome P450 mcmooxygenases. These axe 
enzymes occurring in both ptokaryotes and eukaryotes. 

The spatial structure of these proteins has been highly 
conserved during evolution. Hie most diaracteristic part <^ 
each cytochrome P450 is the presence of a haem group 
which is oovalently bound to the protein by a sulfur-iron 
bond. The sulfur always originates from a cysteine mol- 
ecule. The iron atcm is located in the middle of a pocj^yrin 
ring with ^v^cfa the atom has four bonds. The sixth ligand of 
the iron atom is involved in the catalysis of tiie reactiCHi. An 
oxygen is bound to it during the process. Accosdingiy^ the 
manner in isriiich the hacm gtoi^ is bound to the protein is 
characteristic of this dass of proteins and provides fco- a 
specific absorption peak at i^jproximatety P450 nanometer 
(CO reduced f oim). 

Cytochrome P450 proteins are only functional in an 
enzynae complex together with one cr two other proteins 
which talcE care of the transfer of electrons from NAP(F)H 
to dke active center of the c^toc^rome P450. 

Rou£^, the class of cytochrome P4S0 enzyme systems 
can be divided in two subclasses, viz. the eukaryotic 
microsomal P450s and the dass of ^^okaryotic-like'' 
P450s. The general characteristics of the eukaryotic 
microsOTial P450s are that they are bound to the membranes 
of eukaryotcs, that they are two component enzyme systems 
(reaction specific cytochrome P450 and general NADFH 
cytodxrome P4S0 oxidmductase) and that the reaction is 
NADFH-dq>endent The general diaracteristic of die other 
subclass is that the enzyme oon^lexes consist of three 
compCMients. This subclass can be further subdivided in two 
groiq>s, viz. the bacterial P450s (soluble, mostly NADH 
dependent^ conq)lex consisting of a reaction specific cyto- 
chrome P4S0 and a general Fe-S protein and NADH 
reductase) and the eukaryotic P450s occurring in organelles 
such as mitochondria (membrane-bound, mostly NADFH 
dependent, complex consisting of reaction specific cyto- 
chrome P4S0 and a geiKxal Fe-S protein and NAD(P>H 
reductase). 

The cytociirome P450 proteins of the first subclass 
accordingly conq>lex to the membrane with the protein 
NADFH cytochrome P450 oxidoreductase. This protein, 
like cytodirome P450s, is found in the microsomal 
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(endc^lasmatic reticulum) membrane ctf lower eufcaryotes, 
plants and animals. 

Less is known about the structure of this protein than 
about that of cytochrome P450s, but here, too, a high degree 

^ of structural conservation must have occurred during the 
evolution^ in view of the fact that functional exchange of 
NADPH cytodirome P450 OExidoreductase has been dem- 
onstrated between yeast and mammal systems. 

In microsomal enzyme systems of various eukaryotes, it 
has been demonstrated in different cases that particularly 
after induction of a ^>ecific cytochrome P450, the oxi- 
doreductase could be present in a minor proportion. Thus, 
the amount of reductase would be detenninative or the rate 
of conversion of compounds to be modified by cytochronw 
P450. Increasing the amount of NADPH cytochrome P450 
oxidoreductase in the cell could then increase the biocata- 
lytic activity (in respect of reactions catalyzed by P450s). 

To date, little is known about cytochrome P450 enzyme 
systems in filamentous fungL Only a few cytochiome P450 

^ enzynoes have been diaracterized and rather little knowledge 
is availat^e about &e way of dectron transfer firom NADPH 
to the cytodirome P450. It has been demonstrated that with 
fungi, too, the transfer of electrons occurs most probably 
through a NADPH cytochrome P450 oxidoreductase. In this 

^ connection, use was made of inomunological and biochemi- 
cal esq^eriments (Scala, Matthews, Crosta & Van Etten (1988) 
Experimental Mycology 12, 377-385). Tbcy show in this 
artide that NADPH cytodirome P450 oxidoreductases of 
other filamentous fungi can oonq>lement the endogenous 

^ NADPH cytochrome P4S0 oxidoreductase in reconstitution 
experiments with pisatin demethylase from Nectria haema- 
tococca (CYP 57) using an in yitro pisatin demethylase 
assay. 

35 These authors further show an immunological relation- 
ship t>etween fiie NADPH cytochrome P450 oxidcreduc- 
tascs of filamcDtous ascomyoetes tested by them, a telatioa- 
sh^ ^i^ch is absmt with other NADFH cytochrome P450 
oxidoreductases, such as those of the yeast Saccharomyces 

4Q cerevisiae. 

To date, a limited number of genes coding for cytochrome 
P450 enzymes firom filamentous fungi have been doned. 
Ftnrther, some biocfaemical/biotransfcsmation knowledge 
about a few cytochrome P450 ^izymes is available. The 

45 genes widcb have been doned to date code for a lanostetol 
14a-dcmethylase fr<Hn PmdUium Uaiicum (CYP51), ben- 
zoic add para-hydroxylase fromA^rgiilus niger (CYP53X 
a cydoheximide indndbie cytochrome P450 firom Neun^ 
spoTU cmssa (CYP54), a nitrate/nitdte indudble cyto- 

50 dnome P450 firom Fasariian oxysparum (CYP55) and 
pisatin demetfayUse from Nectria haemasococca (CYPS7). 
In particular with regard to the first (CYP51) and the last 
(CYP57) enzyme system, a fair amount of tnochemical 
knowledge is available. The CYF51 enzyme is the point of 

S5 ^ipiication for vay many fungiddes and has ther^ore been 
studied firom the point of view of use for years. In addition, 
biodiemical knowledge has been patilished with regard to 
hydroxylation of biphenyl-like conqxxmds by Aq>exgillus 
species. 

60 Heterologous ca^ression of a fungal cytochrome P450 in 
a non-rdated fungal host has been dcsotibed for CYP57. It 
has t>een found that the mere transfer of the gene coding for 
pisatin demethylase fxcm M haematococca to Aspergillus 
nidiilans leads to the new host acquiring the ability to 

65 demdhylate pisatin. This indicates that all other oonq^nents 
required for the formation of a functional enzyme oon^lex 
in AspergiUus nidukms were already present and that they 
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are also capable of fonmng in vivo an active enzyme 
con^lex with a heterologous cytochrome P450. 

iDcieasing the production of a specific cytochrome P450 
enzyme (for instance throug}) the introduction of additional 
gene copies coding for this enzyme) definitely docs not 
always ensure an increase of Ifae total activity of this specific 
enzymatic reaction. In fact, the introduction of several P450 
enzymes to increase the enzymatic activity is suitable only 
to a certain extent. This will ^ipear firom a publication (Van 
Gcccom et aL (1990) MoL Gen. Genet 223,192-197) 
describing die (doning and characterization of the gene 
coding for the cytocfarome P450 enzyme benzoic acid para- 
hydroxylase firom Aspergillus niger. This publication also 
describes an atten^ to in^irove the benzoic acid para- 
hydroxyiase activity of A, niger. This was attempted by 
increasing the producticMa of the cytochrome P450 in ques- 
tion. However, die increase of the producticMi of the benzoic 
add para-hydroxylase did not have a positive effect on the 
benzoic acid para-hydroxylase activity. Appar^tly, another 
component of the enzyme complex was limiting. It could be 
that the P4S0 mzyme is not supplied with sufiScient elec- 
trons which are necessary to enable the conversions to t>e 
carried out. 

As yet, little is known with regard to the manner in which 
these cytoairome P4S0 enzymes are supplied widi their 
dectroos. The dcMior in the microsomal systems described 
here is NADPH, but it has not yet been demonstrated in 
detail how the transport proceeds. It has been demonstrated 
biochemically that, as with other eoJcaryotic microsomal 
systems, this is probably done by one enzyme: the NADPH 
cytochrome P4S0 oxidoreductase. It is proposed that the 
oxidozeductase could be that Ifmiting factor. 

Acondingly, aldiough a number ot P450 enzynoes for 
pCTt»™i*r filamentous fungi have been described, the cor- 
responding oxidoreductases from those specific fungi were 
not available up to the time of the present invention. 

SUMMARY OF THE INVENTION 

Accocding to the inventicHi, the enzymatic activity of 
P450 enzyme systems in filamentous fimgl can be jn^voved 
by means of a new gene coding for a new ooddoccductase, 
by means of new oxidoreductase and/or by means of the new 
hereinafter described in detaiL 

Whenever in the present specification 'polypeptide* or 
*protcin* is mentioned^ these terms are understood to refer 
inter aUa to a pdypqiitide ^^ddi is at least partly coded for 
by a DNA sequence according to the invention. Bxan4>les 
include fragments and/or derivatives of the new oxidoreduc- 
tase ntliicfa exhibit the desired activity. 

The only thing that could possibly be used up to die time 
of the invention to improve cytochrGoae P450 reactions in 
fimgi are cytocfarcxne P450 oxidoreductases from odier, 
Don^dated oc]ganisms such as yeasts, i^ants, t"*^*^^ and 
mammals. These src the oiganisms whose oofiesponding 
genes arc available. In the endeavor to use bonoologous 
systems which are adapted to the host as best as possible, the 
use of a cytochrome P450 oxidoreductase stemming firom 
fiingi is naturally to be preferred. 

On the basis of the known P4S0 cytochrome oxidoreduc- 
tases from other organisms, it should be pos&iMe with 
standard techniques to p&di: up, isolate and done the oxi- 
dozeductase from filamentous fiingi- 

Suipfisingty, the conent tedmiques, such as nudeic add 
hybridizations and standard PCR (Polymerase Qiain 
Reaction), failed in the present case. As will be described in 
the exaiiq>les widi the piesoit invention, it has proved to be 
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impossible to find a gene coding for the oxid<»:eductase of 
filamentous fimgi with probes (single-stranded DNA-pieces 
\v1uch are comi^cmentaiy to conserved sequences in other 
oxidoreductases) which have been prepared in the conven- 

s tional manner. 

In addition, it has been found that the enzyme normally 
used for PCK (Taq-DNA-pdymerase) is also unsuitat^e for 
piddng up the gene that codes f<^ an oxidoreductase of a 
filamentous fungus using the primers selected by us, which 

10 are based on conserved domains in other NADPH cyto- 
dnome P450 oxidoreductases. 

Only through the choice of a very highly degenerated 
probe in combination with an unconventional polymerase 
have we succeeded in obtaining the gene according to the 
invention. 

The new gene conoprises a sequ^ce as shown in FIG. 1 
(SEQ. ID NO:l). The amino add sequence derived diere- 
firom is shown in FIG. 2 (SEQ ID NO:2). 

20 For different kinds of uses it is interesting to increase the 
triocatalytic activity of a fungal celL The enzymatic activity 
level naturally present is often not hig^ enough for nMxe 
industrial uses of these enzyme systems, for cxanqile the 
ddoxification of chemicals and toxins in waste, soil, water 

25 and air (environmental uses), the modification and ddoxi- 
fication of (raw ™ti^flift for the production of) foodstuffs 
and cattle feeds and the use d[ fungi, or enzyme ooxxq>lexes 
isolated therefrom, for carrying out one or more enzymatic 
conversions for the purpose of the syndiesis of a chemical or 

3Q i^iarmaoeutical intermediate or end prochict. It is also cGOr 
ccivable that a particular product can be produced entirely 
through fermentation of a fungus. If this involves one or 
more cytochrcMne P450 enzymes, the production can be 
imptoved by a strain wifii an increased N ADFH c3rtochrome 

33 P4S0 oxidoreductase activity. 

Cytodurome P450 enzyme activities are often the 
vdodty-limiting steps in a pathway. The increase of this 
type of enzyme activities win therefore have a positive effect 
on the extent and rate of conversion of conqmunds synthc- 

40 sized or degraded by such pathway. Hxanq)le5 of conver- 
sions (partfy) catalyzed by cytodoome P450 are the para- 
hydroxylation of benzoic add and other aromatics, various 
hydroxylations of biphenyl-Hlae conqnunds, the specific 
hydroxylation, qxnidation and demettiylation of aromatic 

45 and non-ffiomatic oo0qx)unds with a TwmitjTing structure 
(soch as PAKs and steroid-like nKileculesX die terminal 
hydroxylation of (short aikd loiig chain) alkanes and fatty 
adds, die dcn^th^atioa of phytoalcxins, etc., etc. At prescat 
it is assunftcd that many cytodirosne P450 enzymes are yet 

so to be discovered. However, in the case of these as yet 
unknown enzyme activities, too, the invention described 
here can contribute to the increase of the activities in 
question. 

The results of the invention disclosed here are not limited 
53 exdusivdy to the organism with wludi ttie investigaticHi has 
bera cairied out, Le. Aspergillus. Two im portant reasons for 
this can be mentioned. Firstiy, the ocmservation among 
hfxndogous genes of filamentous fungi is mostly such that 
genes isolated firom Aspergillus can serve as a tod for 
60 (singly) obtaining the corresponding genes from other filar 
mentous fimgL In view of the conservation in the family of 
cytodiroane P4S0 oxidoreductases, it is expected that this 
win definitely qn'ly to dds gene too. Secondly, the cprA 
gene (the gene coding for the oxidoreductases according to 
6S the invention) of Aipersi^iig^ can also be used direcdy 
in other filammtous fungi as a supplier of additional 
NAI^^ cytodirotne P4S0 oxidOEreductase activity. It has 
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been found in the i>ast years that the isolation of genes from 
fungi through heterologous hybridization with the corre- 
sponding gene from yeasts as probes is quite often unsuc- 
cessful In these cases the use of a conesponding gene 
stemming firom a filamentous fungus often did |nx>ve suc- 
cessfiiL Even the isolation of genes through PGR experi- 
ments using as primers oligonucleotides designed starting 
from (possibly) conserved regions in yeast genes did not 
prove by any means to be always a useful approach. 

it has fuiti:ker been found that genes from filamentous 
fungi can be emiBcnfly used for carrying out corresponding 
tasks in other fungi The promoters are normally functional 
and introns, too, unHke in the case of yeast, are correctly 
spliced by other fungL Exchange of protons in fiingt foaor 
ing part of a ooore complex system, which also holds, for 
instance, fOT cytochrome P450 raizyme systems, has already 
been found to be a real possibility several times. Both a 
subunit of tubulins and a nudeo-coded subunit of a mito- 
chondrial ATPase cQCDplex have been found to be exchange- 
able between different kinds of filamentous fungi in vivo. In 
vitro, the exchangeability of NADPH cytochrome P4S0 
ooddoreductase between different filamentous fungi has 
already been demonstrated. 

^th regard to the acdvi^ of the NADPH cytochrome 
P4S0 oxidoreductase, it is conceivable that modifications on 
the protein (provided in dtrected or non-directed manner) 
can lead to an NADPH cytochrome P450 oxidoreductase 
with better properties in general or for a specific use. This 
kind of aiyustments to the protein (via adjustments in the 
gene) have become possible through the piesent invention 
and therefore fall within the scope ai protection. 

K is also known, in respect of odier organisms, that fusion 
cf a specific cytochrome P450 with the NADPH cytodnx>me 
P450 ooddoreductase, to be effected through a fusion of the 
genes in question, can lead to enzymatically active mol- 
ecules. In a number of cases, this nuy even have a positive 
effect on the total enzymatic activity of the cytochrome P450 
in question. Such a fusion is also understood to foil within 
the scope of the present invention. 

It is highly doubtfiil Aether it is possible to eliminate the 
NADPH cytocfaroinc P450 oxidoreductase activity in the 
cell. This enzyme is involved in a number of activities i^^ch 
are essential to the cell, so that a mutation (directed or 
landomly provided) in the genome which leads to die 
ellminatioa of the NADPH cytochrome P450 oxidoreduc- 
tase acthrity is probably lethaL However, it is quite plausible 
that a change of the expressicHi signals of the gene that codes 
for NADPH cytochrome V450 oxid<xeductase (cprA) can 
also lead to die desired effect (a <^iange (mostly an increase) 
in the NADra cytochrome P4S0 oxidored uct ase activity). 
The data presently available indicate that the promoter of the 
Aspergillus niger cprA gene is not patticiilaily strong. On the 
basis of the knowledge beoconing available through this 
inventi<», the strategy wbaxA, will lead to an increase in the 
NADPH cytochrome P450 oxidoreductase activity through 
modification of the promoter and/or the 5* imtranslated part 
of the <^A gene is sinq>le to ^>ecify. It can be effected 
through modification (directed or random) of its own 
sequences involved in die initiation of mRNA and protein 
synthesis of the cprA gene, ft can also be effected through 
replacement of the promoter and/or a part of the region of 
the qprA gcae coding for the 5 untranslated part of the 
mRNA by other, preferably more efficient, sequences 
(synthetic or stemming from a different gene). 

A totally different use of the gene is the use of this gene, 
or parts thereof such as the promoter, as a rqwiter for the 
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detection of inducing agents zoals toxins, xenobiotics and 
the like. Because of the general cdiaracter of this enzyme it 
is quite conceivable that in this manner a 'wide-spectrum' 
diagnostic can be developed. In this connection one could 

5 for instance consider fusion of the expression signals of the 
gene to a simply detectable reporter gene (coding, for 
instance, for p-galactosidase, ^gilucuronidase or luciferase), 
or detection methods based on immunological techniques. 

Although the in vitro use of cytochrome P450 enzyme 
systems is not yet employed on a large scale, this invention 
provides the possibility of producing one of the oonq>onents 
of such an in vitro enzyme system (the NADPH cytochrome 
P450 oxidoreductase) on a latge scale. This invention also 
contemf^ates die making of fiingal strains which produce 

1^ both die cytochrome P450 in question and the NADPH 
cytochrome P4S0 oxidoreductase in increased amounts, so 
tiiat the enzyme conq)lex can be isolated therefrom in 
simfder manner and in larger amounts. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows the DNA sequence of SEQ ID NO:l. 
FIG. 2 shows the amino add sequence of SEQ ID NO:2. 
^ FIG. 3 shows a representation of the picked-up positive 
Xr<lone, X19-1 and the subclones ccmstructed therefrom, 
pCPR2 and pCPR7. (S=SaII, B=EccRi, Bg^BglH, K=KpnI, 
N=NcoI) 

FIG. 4 shows a r^iresentation of the insert of pCFR2, on 
30 which the CPR gene is located. The regioas whose sequence 
has been detemoiDed are indicated by arrows. (Bg=BgIII, 
fcKpnl, N=NcoI, XF=XhoI, P=PstI) 

FIG. 5 is a rcfsresentation of the 14 dm gene fixm 
Peni ciUiu m italicunu 

FIG. 6 is a gn^ showing fungicide sensitivity as set forth 
inTible VL 

DETAILED E^SCRIFTEON OF THE 
INVHNmON 

40 

Experimental 

Desdibed herein is tlie dcMiing and characterization of a 
gene \^cfa codes for a NADPH cytochrome P450 oxi- 
doreductase or a filamentous fungus. 

45 This involves in particular the cprA gene of AspergilUts 
niger ATCC 1015. Initially, an attenqit was made to isolate 
the gene tiiat codes for NADPH cytochrome P450 oxi- 
doxeductase from A^ergiUus niger throu^ h^erologous 
hybixdizatioa. The corresponding genes of the yeasts Sac- 

50 charomyces eerevisiae and CUzndida tropicalis were used as 
probes. These experiments did not lead to positive results 
(see Bxanq>le I). 

Then an atteixq>t was made to isolate the gene through 
PCR. On the basis of a cotiq>arison of known proteiii 

55 sequoices d NADPH cytochrome P450 oxidoreductases 
from other organisms, primers were designed which code for 
amino acid sequences from the most conserved regions. 
These primers were used in PCR experiments as described 
in Bxanq^e VL This PCR gave various bands, but not at the 

60 expected heights. Subsequently, after a gel dectroj^ioresis 
of the PCR mixture, the region of tiie expected size was 
excised from gel and the (non-visible) DNA was isolated 
therefrom. Then a renewed PCR reaction was performed on 
this DNA with the same primers. This PCR did yield a 

65 fffoduct of the e3q>ected size. This PGR product was isolated 
and cloned into pUC19. However, DNA sequence analysis 
of die isolated done yielded a DNA sequence which did not 



show any similarity whatcvcx to a known NADPH cyto- 
cfarotne P450 oxidcceductase sequence. 

When this attcn^t inroved negative too, an attend was 
TWflH^ to isolate the gene by means of a probe synttiesized in 
an alternative way (Exan^le UT). A PCR reaction was 
pdfonned using Aspergillus niger genomic DNA as team- 
plate and degenerated primers derived finom conserved 
regions from the NADPH cytochrome P450 oxidoreductase. 
In this experiment, in contrast with the conventional enzyme 
(Taq DNA polymerase), now a different tfamnoresistant 
DNA polymerase was used, viz. the so-called Tth DNA 
polymerase. This enzyme comes from the bacterium Ther- 
mus ihemophylus and is generally recommended for use in 
PCR experiments with RN A as template. By means of this 
enzyme, too, many different bands were fcund, amcmg 
others a band of the expected size. This band was isolated 
and cloned in pUC19. Sequence analysis ot a single done, 
pCPRl, gave a clearly recognizable similarity (at psotein 
level) to the NADPH cytochrome P450 oxidoreductase 
sequences known at that time. Tliis clone was used as a 
probe for the pinpose of the isolation of the NADra 
cytochrome P450 oxidCTeductase gene Aspergillus niger 
from a gene Ufaraiy. 

The Aspergillus niger cprA gene was isolated on a 7 kfo 
EcoRI-Sall fragment (pCTRT) and a 3.7 kb BgO^Kpnl 
fragmem (Bxaiiq>le IV). The DNA sequence of the complete 
3.7 ]d> RglTT-Kpnl fragmoU was determined. It is shown in 
FIG. I (SEQ ID NO: IX ^ this DNA sequence an open 
reading frame was found, coding for 692 amino adds. This 
open leading frame is intexiuptcd once by a DNA sequence 
coding for an introo (see Bxan^le V). The start of the 
mRNA was also determined (JExxnapic VI). 

A conq>arison of tiie deduced amino acid sequence with 
the known amino add sequences of NADPH cytochrome 
P450 ooudoredactases from other organisms indicated that 
the idcaticality with the NADPH cytocfaromc P450 oxi- 
dcreductase most closely related on the basis <^ amino add 
sequences (that of the yeast Sact:haromyces cerevisiae) is 
only 40%. In various "conserved" regions of the AspergUius 
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EXAMH-EI 

Cloning of the cprA gene of Aspergillus niger by means of 
hetearologous hybridization 

Insofar as they are not described in detail^ techniques were 
canied oat as described (Sambrook, Fritsch & Maniatis 
(1989) Mdeculair cloning. Cold SfKing Harbor Laboratory 
Press, U.S.A.). 

An attend was made to isolate the gene coding for 
cytochrome P450 reductase of Aspergillus niger AUCC 1015 
by means of heterologous hyMdization experiments. To that 
end, chromosomal DNA (5 ^g) of Asper^Uus niger and of 
Saccharomyces cerevisiae was digested with Hindm and 
with EcoRL This DNA was separated by means of electro- 
phoresis on a 0.8% TBE-agarose gd and transferred to a 
Hyt>ond N membrane (Aroersham). In this way several 
identical blots were made. After fixation of the DNA on the 
membrane try tHiking for 2 hours at 80"* C and 2rA hours of 
prehyfaridization, the blot was hybridized with a labeled 
(Amersham multiprime DNA labding kit, in accordance 
with tbc manu&ctnrcr's directions) cytochrome P450 reduc- 
tase specific probes coming from Candida tropicalis and 
Saccharomyces cerevisiae, icq)ectivdy. As a p^>be for the 
Candida trofHcaUs CFR goie, a 820 bp BooRI fragment 
containing a part of die cprl and a 3.7 kb Spd fragment 
conqjdsing the entire CPR gene were isolated from plasmid 
pTSl (Sutter ct aL, 1990, J. Biol. (3icm. 265 (27), 
16428-16436). As a Sac<^romyces cerevisiae CFR specific 
probe, a 700 bp BamHI fragment containing an internal part 
<rf the CFR and a 3.3. kb PvuII fiagment coo^rising the 
entire CFR gene were isolated from plasmid pTS20 (Sutter 
A Leper, 1989, Bioch. Biof^s. Res. Comm. 160(3), 
1257-1266). TTie fragments were s^>arated by dectrophore- 
sis on a 0.8% TBB-agarose gd and purified by means of the 
Geneclean kit (BiolOl) in accordance with the siqjplicr^s 
directions. The blots were hybridized and washed at 56"" C 
as described (Sambroc^ Pritsdi & Maniatis (1989) Molecu- 
lar d<Miing, Cold Spring Harbor Laboratory Press, U.S.A.). 
The last walking stq;> was performed at 56^ C. with 6xSSC, 
0.1% SDS. ^Wlh toe & cerevisiae probe, dear, specific 
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rajofr NADPH cytodnome P450 oxidoreductase, too, the 40 hybridizUig bands could be seen in the lanes wim & c*wi- 
ideaticality is not 100%. jioe chronwsomd DNA. Eaqjerimcnts with the C. tropicalis 

The expression of the AspergUius niger cpr A gene on an probes were exdosivety performed on cteMnosomal DNA 
mlWA Icvd was also analjwL The ca^Bcsdoo of the gene of Aspergillus niger. Surprising^, with neither probe 
is found to be not v€ayhig|i,wliidi provides possiMities for hybridiztng bands were foimd in the lanes wife Aygyjgitttf 
the imptovemcm of the eaipression by in^woving the cprA 45 nigier dirotnosamal DNA, possibly caused by too liintod a 



promoter or by replacing the cpr A promoter by a different, 
stronger promoter. 

The effect of die introduction of several oofrics of the 
A^ergUlus rtiger cprA gene on file extent of NADPH 
cytochrome P450 oxidoreductase activity was studied, as 
was the effect thereof on the cytochrome P450 activity. 

Exanqile XI describes that die inlrodncdon of several 
copies of the cpcA gene into Aspergillus niger can lead to a 
strongly increased NADPH cytochrome P450 oxidoreduc- 
tase activity. The efifect <^this on the benzoic acid paraby- 
drooty lase activity of fiiis strain (wild-type) is also presented 
in rti^fi cxanqile. 

Exan^Je XH describes that the introduction of several 
copies of die cpiA gene into an Aspergillus niger transf or- 
mant which already contains several copies of the benzoic 
add parahydroxylase gene (bphA) (and, as a resuU, has an 
increased |Hxxluction of the benzoic add para-hydraxylase) 
f^n lead to a very sidsstantial increase of the BPH activity. 
These experiments deady show that the NADra cyto- 
chrome P4S0 Qxidoreductase gene described in this pfUent 
^jpUcation, when introduced into a fungus, can have a very 
positive effect on cytochrome P450 activities. 
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homology between fiic Aspergillus niger cytochrome P450 
reductase gene on Ibe <Hie hand and the S. cerevisiae and C. 
trojBcalis cytochrome P450 reductase genes on the other. 

EXAMFUin 

Qoning of the cgrA gene of Aspergillus niger by means of 
PCR with Taq DNA polymerase 

In view of the negative results in the heterologoos hyfarid- 
izatioa experiments, it was decided to itftempt to synthesize 
a cpr specific fragment by means of Ihc PCR technique. 
Degenerated pdmers were resigned, based on conserved 
sequences in known cytochrome P450 oxidorediictase genes 
crfrat (Porter & Kaspcr (1985) Ptoc Nafl. Acad- ScL U.Sj^. 
82, 973-977); Candida tropicalis (Sutter et aL (1990) J. 
BioL Cbcm. 265, 16428-16436); and Saccharomyces cer- 
evisiae (Sutter & ILoper (1989) Bioch. Biophys. Res. Conun. 
160, 1257-1266) (J!atie I). At the ends, additional sequences 
were provided, coding for specific restiicdon sites (5* PCR 
pdmers were provided with an EcoRI restriction site, 3* 
primers with a BamHI restriction size) to facilitate the 
doning of PCR products and to increase the specificity of 
the procedure. 
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TABLE I 





EcoRI 


MBL997 


5'- CCG GAA TIC CACO/A) ACN GGN ACN GCN GA<a^A) OA - 








dtgaMontcd 1024 tixnes* FE4K btadizig site 




EcoRI 


MBL998 


3' - CXX3 GAA TTC GGN GAN CCN ACN GACOT) AACCVT) GC - 








de^BDcrated 1024 times^FMN bincfic^ site 




BamHI 


MBL999 


5* - OOC OGA TCC GGN CXJf A fOTJN MOnfA) (Q/T/A) AlN 








deseiKntod4608 times, NADFH biucfiiv ate 




BesnHI 


MBLIOOO 


y - GC3G GOA TCC T(OG^ CC/H TON AC(Q/A) TAN ACCCfT) 




Tr-3'(SEQn>NO:6) 




degeniiled 236 tnnes, NADFH binding site 




BanHI 


MBLlOOl 


5' - CGC GQA TCC GGN OC(aT/A) ATC ATfOayA) AlN AC - 




3'<SBQIDNO:7) 




dftgBOCxaied 144 ftiw>ft> NADPH tnndipg nie 



Taibic I, sequences of tiie primers used for Ifae isolation of a 
cpt specific probe. N stands for G/AA7C. 

For the PCR leactiom, T&q DNA polymerase (Peridn 
Elmer) was used. As template, Aspergillus niger chromo- 
somal DNA was used. By way of check, PCR reactions were 
peifonned using as ten^date 10 ng of theplasmidpTS20 (on 
wtdch is located the entire cytochrome P450 reductase gene 
cf Saccharomyces cerevisiae) and chromosomal DNA of 
SacchoFumyces cerevisiae. 

Template and primers were denatured for 10 rnimiti>« at 
94* C, foflowed by 25 cydi (1 minute 94** C 1 minute 43'' 
C 2 minutes 72° CX After 25 cydi, incut»ation took {Hace 
for 5 minutes at 72** C 

When 1^ DNA polymerase with plasmid pTS20 as 
tempi afe was used, fragments of the expected size were 
found when primer combinations MBL997-MBL999, 
MBL997-MBL1000 and MBL997-MBL1001 were used. 
The use of primer MBL998 did not in any of the cases lead 
to synthesis of a product of the expected size. 

The use of 5L cerevisiae chromosomal DNA as tenq>late 
led to a dearly demonstrable product of the eiqiected size 
when the primer combination MBL997-MBL1001 was 
used. The use of the combination of MBL997 with the 
primer MBL999, degenerated 4608 times, led to a wealdy 
demonstrable product of the expected size. In all of these 
cases a large amount of a specific product cf smaller sizes 
was found. ^Ih the primers MBL998 and MBLIOOO, using 
S. cerevisiae dxronoosomal DNA as toi^late, no cGcrect 
PCR product could be demonstrated. 

When AspergUlus niger diromosomai DNA was used as 
tcmptatc^ and using Ikq DNA polymerase^ no product of the 
expected size was found with any of the primer combina- 
tions used. What could be demonstrated in all possible 
combinations was a large amount of a specific, too small, 
product 

The total material of a single experiment with the primer 
combinations MBL997-999, MBL997-1000 and 



MBL997-1001 was sqxaraled through electrophoresis on a 
0.8% TBE-agarose gd. Apiece of gel at the level cf a marker 
band of 1.2 kfo (the expected size) was excised. At this 

^ location in the gd no DNA was visitsieL DNA from tliis gel 
fragment was isolated by means of the freeze squeeze 
method. The so obtained material was used as template for 
a new PCR reaction. Surprisingly, after this Teacticn a 
product at the expected size was found M/h&n using the 

^ primer combinations MBL998/999 and MBL998/1001. 
However, with aU of the primer combinations used 
(MBL997-MBL999,1000a001 and MBL998-MBL999, 
1000,1001) mainly smaller, a specific^ products were found. 
The product of the expected size that was found was 
separated through gd dectrofAoresis cmi a 0.8% Low Mdt- 
ing Point agarose gd ultereaiter DNA was isolated by 
means of p-agarase (New England Biolabs, in accordance 
wifii the directions of the siqipUer). The DNA obtained was 
digested with EcoRI and BamHI and doned into the EcoRI 

^ and BamHI sites of pUC19. After tnmsformation to £lscA«n- 
chia coU^ miniscreen DNA was isolated from 12 ampidHin 
resistant colonies. After digestion of the obtained miniscreen 
DNA with EooRI and BamHI, only one constract was found 
to contain an insert of the cozrect size. Of this a large 
plasmid isolation was done, using a Qiagen T^500 in 
accordance with the supi^ier's instructions. The DNA 
sequence of this construct was analyzed by dideoxy 
sequence analysis (Pharmacia T7 sequencing kit, according 
to the supplier's instructions) using the M13 universal and 

^ reverse primes In the DNA sequ^icc as detennined, no 
homology with known cytochrome P450 reductase genes of 
other organisms was found. 

EXAMPLBin 
Ooning of the cprA gene of A^ergilius ftiger by means of 
65 PCR with Tth DNA polymerase 

As an alternative, it was decided to perform PCR experi- 
ments using, instead of 1 unitTaq DNApdymcrase, 0.1 unit 
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Tth DNA polymerase (Spaero Q), an enzyme whic^ is 
particulaily suiubLe for RT-PCR (reverse transcriptase PGR) 
experiments. Using plasmid pTS20 as ten^late, this resulted 
in products of the expected size when using primer combi- 
nations MBL997-999, MBL997-1000 and MBL997-1001. 5 
When Aspergillus niger chromosomal DNA was used as 
ten4>late, this resulted in a laigc amount of products when 
using primer combination MBL997-1001. The majority of 
these products were found to be smaller than the expected 
size. Surprisingly, when using these degenerated primos in to 
combination with Tth DNA Polymerase, in addition to die a 
q>ecific products, a product of ^e eiqpccicd size was found. 
Hic PGR products wore separated by means of gel electro- 
phoresis on a 1% TBE-low melting point agarose gel, 
whereafter the presumably coirect product was isolated by is 
means of ^-agarase. 

After digesti<»i of the obtained DNA with EcoRI and 
BamHI,the firagment was cloned into the EccRI and BamHI 
restriction sites of plasmid pUC19. After transf<Hmation to 
E. coUy miniscreen DNA was isolated from 12 ampicillin 20 
resistant colonies. After digestion of the miniscreen DNA 
with EcoRI and BamHI, 4 of the 12 pr^)arations woe f mmd 
to contain a plasmid with an insert of the coirect size. 

Of these four preparat ions, of one constract (pCFRl) a 
large DNA Isolation was done by means of a Qiagcn TlpSOO. 25 
Of this DNA, the sequence of a part of the insert was 
determined by means of the M13 universal and reversed 
pcimen This sequence was found to contain r^ions which, 
at an amino acid level, were clearly homologous to other 
cytochrome P450 reductase genes. so 

EXAMPLE IV 
Screening of AspergiUus niger X-gene library with the cpr 
specific probe 

Genomic DNA of the Ajpergillus niger strain ATCC 1015 
was partially digested with Sau3AI and separated by gel 
dectrophovesis. DNA of between 13 and 17 Idb in size was 
isolated from ^e gel and the DNA was isolated. The isolated 
DNA was doned into vector AJSMBL3, digested with 
BamHL This genomic library was screened with the cpr ^ 
specific probe isolated firom plasmid pCFRl (the 1.2 kfo 
EcoRI-BamHI fragment, see FKj. 4). The isolated EcoRI- 
BamHI restriction fcagmeat, witti the cpr q>ecific VCR 
fragment, was radioacttvely labeled with ^^-dCIP using 
the mult^irinke DNA labeling Idt. (Amcrsham) in accor- ^ 
dance with the supplicr*s instractioas. Labeling reactions 
were pqfumi ed for 3 hours at room tcnqicrature. The probe 
was purified by spin column filtration over a 1 ml Sephadex 
G50-medium cohmm. About 40,000 pfu were screened 
(about 20 times the genome). In the first scxeening 13 ^ 
positive fdaqucs were isolated. These were screened again. 
After the second screening, 3 positive fdaques remained 
(X^3, XU-l and X19-1X Unoog^ Soutfaem analysis it was 
demonstrated that all clones contained fragments which 
hybridized with ^e cpr specific probe. The restriction pat- 
terns of these denes were dearly related but not identicaL 
A restriction map <^ d<Mie K19-1 was made (see FIG. 3). A 
7 kb EcoRI-San firagment and a 3 .7 kb Bgin-Kpnl fi^gnoent 
of cloiieXl9-l, which contain the coding iegi<Mi of the cptA 
gene, were isolated and d<med into vector pBluesGr^>t-SK 
IT (Stratagene) digested, respectively, with EcoRI and Sail 
and with BamHI and KpnL This resulted in the vectors 
pCPR7 and pCFR2 (see FIGS. 3 and 4). 

EXAMPLE V 

Sequence analysis of the AspergiUus niger cpr A gene 65 

The sequence of botli strands of the Aspergillus niger 
Bg^-Kpnl fragment doned into plasmid pPCPR2 was 



determined according to the ^dideoxy chain terminating'^ 
method (Sanger et aL (1977) Proc. Natl. Acad. ScL U.S.A. 
74, 5463-5467) using the ^'S-dATPT? DNA sequencing kit 
(Pharmacia) in accordance with the supplier's instructions. 
Sequence analysis of subdones was performed using the 
M13 universal and reverse pimcrs. Parts of the sequence 
were determined using synthetic oligonudeoddes, using 
pCPR2 ds-DNA as template (see FIG. 4). Sequence infor- 
mation was analyzed with GCG software (Devereux et al., 
(1984) Nucleic Adds Res 12, 387-395). 

In the DNA sequence a region possibly coding for an 
intron was found (FIG. 2, position 2367 . . . 2437). The 
actual absence of this intron in the mRNA of the cpr A gene 
was demonstrated with KT-PCR experiments. In them the 
expected difference in size of the PGR product was found 
when using primers located around the intron, with chro- 
mosomal DNA or isolated total RNA as template (GeneAn^ 
RNA-PCR kit, Peridn Elmer). 

The deduced amino add sequence of the Asperg^lus niger 
cprA grae was found to show a 39-46% identicality to the 
amino add sequences of the <^ the yeasts Saccharo- 

myces cerevisiae, Candida tropicaUs and Schizosacchan>- 
myces pombe and the geaes froax rodents and an ^jproxi- 
mate 35% identicality to the CPR amino add sequences of 
plants published to date. 

EXAMPLE VI 
Determination of transcx^ition start 

For determining the transcrq)tion start of the Aspergillus 
niger cytochrome P450 reductase gene, a primer extension 
expoiment was performed. To that end, two primers, FE50 
and PEIOO, were ^'^gft*^ These primers were based on 
sequences located on, respectively, 50 bp and 100 bp 3' of 
the ATG and read in the direction of the ATG. 
PE50 

5' CXZA-CGC-TAC-CCA-C 3 (SBQ ID NO:8) 
PEIOO 

5' GAG-GCA-TAC-GGG-TC 3* (SEQ ID NO:9) 

RNA was isolated from Aspergillus niger tcansf ormant 
T16 (van Gocccm et aL, (1990) Mol Gen Genet 223, 
192-197). Mycelium was cultured and ground in liquid 
nitrog^L The fine powder was resuspended in 1 ml RNAzol 
(Cinna^iotecx) whereafter the isolation was continued in 
accordance with the siq)plier*s instnictiotts. 

Of bdh primers 100 ng was kinated with 5 \d ^-yATP. 
Reactions were perfamued for 30 minutes at 37*^ C. as 
described (Sambrook, RcitsGfa & Maniods (1989) Molecular 
dotting. Cold Spring Haibor Uboratory Press, U.S.A.>. 
After the incubation the primers were purified by filtration 
over a Scphadex Q50 spin ccdumn. Added to this primer was 
NH4 acetate to a final concoitration of 2M and 2.5 volume 
ice-cold EdianoL Primers were precipitated at —20** C for 60 
minutes. Mmcrs were peUetized by centrifiigation at 4** C, 
14000 rpm. for 30 minutes, whereafter the pellets were 
washed with 70% etfaanol and were resuspeaded in 20 pi TE. 

Primers were ex-ended with M-MLV reverse transc^itase 
for 1 hour at 37** C. The reaction mix (25 pi) contained 10 
|ig RNA, SO mM NaCl, 34 niM Tris-HCI^H 83, 6 mM 
MgCLx, 5 mM DTT T, 20 0 U M-MLV reverse tran scrip tase 
(Gibco), 0.5 mM dCTTP, 0.5 mM dATP, 0.5 daM dCFP, 0.5 
mM dTTP and 5 \d CZ5 ngi) of the Idnatcd primer. 

The reactions were stopped by the addition of 2 pi 0.5M 
EDTA, followed by an alcohol precq)itation (60 mimites 
-20^ C). After centrifiigation (30 minutes, 14000 rpm, 4*" 
C) the pellet was resuspended in 5 pi distilled water. To the 
mix was added 3.4 |il stop mix (Pharmacia T7 polymerase 
sequencing kit). Before bdng ^^lied to gel, the samples 
were dmaturcd by incubation at 95'' C. for 5 noinutes. 
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whereafter tiiey were directly traced on ice. One half oi the 
sanq>le was applied to a 8% wedge shape deaatuiing poly- 
acrylamid gel. 

By way of check, sequence reactions were performed with 
both primers (Pharmacia T7 polymerase sequencing kit) 
with plasmid pCPR2 as template and applied to gel simul- 
taneously with the primer extension mix. After 2 hours of 
electrophoresis (1100 V) the gel was dried and an X-ray 
sensitive film was exposed. 

Strikingly, results with primer PESO were much stronger 
than with pinter PEIOO. In particular in the lanes with. 
primer F^O a great many bands were visible. Combination 
of both results led to the designation of three possible 
transcription start positions, viz. at 86 bp (ctcActc), 55 bp 
(cagActcg) and 37 bp (aagjcgg) bcf rar the ATG. The last 
band was not foimd when pdmer FCIOO was used. 

EXAMPLE Vn 
Transformation of Aspergillus niger strains 

The AspergiUus niger strains N204 (ATCC 1015, csp, 
met, Boschloo et aL, Appl Microbiol Biotechnc^ (1990) 34, 
225-228; contains 1 copy of the bphA gene) and T18 
{Aq?ergiUus niger N271 provided with about 12 copies of 
the bpiiA gene; van Goccom et aL (1990) Mol Gen Genet 
223, 192-197) were cultuied in 250 ml conqilete medium 
(minimal mindiiim siq^lemented with 0.1% Cas amino 
adds^ 0S% Yeast extract, 0.1 mg/ml methionine, 1 ^g/ml 
pyridoxine) in 21 Erlenmeyer flasks. The cultures were 
inoculated with 1.10^ spores per ml and then incubated for 
18 hoars in an air-agitated incubator at 35** C. and 300 rpm. 
After incubatioEU mycelium was harvested by filtration 
through a miradoth filter (Catbiodiem). This mycelium was 
then protoplasted and the protoplasts isolated therefrom 
were transformed as described by Yelton et aL, Free NatL 
Aca. ScL USA 81 (1984) 147(X-1474. 

Samples of 100 pi protoplasts were transformed with, in 
all, 1 Mg circular pAN7-l DNA, 9 |ig circular pCFR2 DNA, 
and 2 pi of a IM ATA (Aureen tri-caiboxyl acid) solution. 
Control tubes COTtained no DNA or <mly 1 ^g pAN7-l (Punt 
et aL, Gene 56 (1987) 117-124). The transformed proto- 
plasts were thai plated on minimal medium agar plates 
siqiplemented with 1.2M sortitoi, 1 pgtel pyndoxine, 0.1 
mg/ml metfaioniDe and 100 pg/ml hygromycin. The plates 
were incubated for 10 days at 35'' C After 4 days the first 
spoanlftting transfocmants became visible. IVansfoonants 
were twice applied wtlh a brush to foim pure cultures on 
mtntiwiT medium agar plates with 100 ^gAnl hygroniycin, 
0.1 mg/ml methionine and 1 pyridoxine. 

Per transformation 10 to 40 hygromycin resistant trans- 
fonnants could be formed. On transfonnatian plates of 
protoplasts which had been treated without DNA, no hygro- 
mycin resistant transfocmants could be found after 14 days 
of incubatioQ at 35*' C. 

EXAMPLE Vm 
Screening cf transfonnants 
T18 mult^le copy qnA transfonnants 

The transfoimants from Exanq)le VH were tested for their 
hygromycin resistance level by plating of spores on mlninml 
medium agar plates supplemented with 500 pg/ml 
hygromycin, 0.1 m^ml methionine and 1 ^g/ml pyridoxine. 

For selecting transfonnants with an increased cprA 
miRNA leveL an RNA colony fayfacidization experiment was 
performed according to a modified version of the protocol as 
was described for SacchoFwnyces cereyisiae by Stepioi and 
Butow (1992, NucLAcids Res. 18(2) p380). Minimal 
medium agar plates were inoculated with spores of 
transfcrmants, covered with a Hyfoond-N filter, and subse- 
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quently inoibated at 25° C. untU mycelium was just visible 
but spore formation had not developed yet The filters were 
transferred to a 5(X) pi <^<'P ^ sorbitol buffer (1.2M sortntoL 
O.IM sodium citratc/pH 5.8, O.IM EDTA, 50 mM 

5 li-mercapto-ethanol), incubated for 5 minutes and subse- 
quently transferred to a sheet of Whatmann 3 MM filtering 
paper. After drying for 5 minutes on the Whatmann paper, 
die filters were transfetred to a Petri dish with a drop (5(X) 
pi) of sorbitol buffer to which 10 mg/ml Novozym 234 
(NOVO Nordisk) had been added. The Prtri dish was closed 

^ airtightly wUh parafilm and the myceliima was protoplasted 
for 1 hour by incubation at 35^ C. After protoplasting, the 
protoplasts were lysed by transfetring the filter to a 5(X) pi 
drop of lysis buffer (2% SDS, 73% formaldehyde, 50 mM 
Tris-HO/pH 7.5, 10 uiM EDTA) fcdlowed by an incubation 
of 5 minutes at room tenqierature. The RNA was blotted on 
the filter by transferring the filter, with lysed protoplasts, to 
a she^ of Whatmann 3 MM filtering papec until the filter 
was dry (about 5 minutes). This blotting step was repeated 
a single time. Finally, the filters were transfenxd for 1 
minute to a drop of 800 6xSSC (Sambrook et al, •♦*), 
0.1% SDS and subsequemly dried by transferring the filter 
to a sheet of Whatmann 3 MM paper. Hie RNA was fixed on 
the Hytvond N filter through UV crosslinking. To that end, 
the filter was set C0 a UV lig^t box for 3 minutes. The filters 
were hybridized overnight at 65** Cwhh a' V<*CrrP labeled 
qirA fnt)be. The filters were washed at 65° C, utilizing 
0.2xSSC:^ 0.1% SDS in the last washing step. After over- 
night exposure at —70** positive transfonnants could be 

3^ identified. 

N204 mult^lc copy cprA transfcrmants 

Hygromycia resistant transformants of N204 were 
screened with an RNA colony hybridization in which a qprA 
specific probe was used. A scccHid screening was performed 
with a cytochrome P450 reductase specific filter activity 
assay. For this pmpose, spores of hygromycin resistant 
transfonnants were inoculated on minimal medium agar 
plates supplemented widi 0.1 m^ml methionine and 1 pgAnl 
pyridoxine. The {dates were covered with a Hybond N filter 

^ and incubated at 25° C. until mycelium was >ist visible but 
virtually no spores had been formed yet. The filters were 
removed, the myoeUnm was lysed by freezing in liquid 
nitrogen and suttsequent defrosting. The filter was incubated 
in the dark for about one hoar in 25 ml neotetrazolimn 
solution (5 mg NeotPtrwzotiimi , 5 mg NADPH per 25 ml). At 
locaticxis with cytochronoe P450 reductase activity, a clearly 
pink preripitatr is formed. 

On the basis cf the two screening mi^ods, transfonnants 
W13 and W35 were selected as presumable multiple copy 

^ CprA transfonnants. 

EXAMPLE IX 
DNA analysis of transfonnants 

Mycelium of transformants selected in Example vm was 
cultured in 50 ml minimal medinm, supfAementcd with 1 

55 iigAnl pyddoxine, 0.1 tDgfral methionine and 0.1% CAS 
amino adds. Erlenmeyer flasks (300 ml) were inoculated 
with 1.10® spores and placed in an air*agitated incubator 
(35** C, 300 ipm). The mycelium was harvested by filtration 
over miradoth filter (Calfaiochem) and washed with 25 ml 

60 0.9% NaCl. The myceUum was immediately firozen in liquid 
nitrogen. From this mycelium, chromosomal DNA was 
isolated according to the method described by Kolar et aL 
(Gene 62 (1988) 127-134). The pellet of the last alcohol 
prec^yitadon was incorporated in 1(X) pi distilled water. 

65 Cfatomosamal DNA (40 |il) was digested with 20 U EcoRI 
and20UKpnIfc»'6hoursat37*> C. After the digestion equal 
amounts of DNA were sepa rat ed on a 0.8% TBE-agarose gd 



5,801,024 



15 



by means of electrophoresis (IS hours, 35 volts). After 
separation the DNA was traDsfcrred to a sheet of Hybond N 
filter (Amersham) in accordance with the instnictiGms of the 
supi^er. After fixatioD of the DNA (2 hours, 80** C.) the blot 
was prehybridized at 65° C. for 4 hours and subsequently 
hybiidized at 65° C. with a ^^p labeled cprA specific probe. 
The blot was washed at 65** C, the last washing st^ 
involving 0.2xSSC, 0.1% SDS. An X-ray sensitive fiJm was 
exposed at -70** C. For 24 hours. 

Seven selected transfcvmants were analyzed in this noai^ 
ner. In one transformant, only the wild-type band was found, 
in four transformants integration of 1—2 copies of the <^A 
gene was found and in two transformants integration of 
several copies of the cpr A was found. Of these multiple copy 
integrants, transfoiroant T18 #5 was selected. 

EXAMFUX 

NADFH: Cytochrome P4S0 (cytocfarome c> reductase activ- 
ity 

CPR activity was detennined by measuring the possibility 
of cell free extracts of transformants to reduce cytochronke 
c (Ad^>fced firom Madyastha et aL (1976) Biocfaemistiy IS, 
1097-1102). 

Mycelium was cultured for 13 hours in 50 ml mininml 
medium, siqiplemented with 0.1% CAS amino adds, 0.1 
mg/ml methicMiixie and 0.1 ^g^ pyiidoadne. The medium 
was inoculated with 1.10^ spascs per ml in 300 ml Erlen- 
meyer flasks in an air-agitated incubator (35° 0, 300 ipm). 
The myceUum was harvested by filtration over miraclodi 
filter (Caibiochem) and washed with 25 ml 0.S>% Nad 
Excess buffer was removed by blotting the fitter with myce- 
lium between tissues. The mycelium was frozen in liquid 
nitrogen and ground in a moctar. The fine powder was 
transferred to an Eppendorf reaction vessel filled with 1 ml 
ice-cold CFR-cadraction buffer (50 mm sodium pbosphate 
buffec^H 7.8, 20% glycerol, 1 mM EDTA, 1 mM DTT, 5 
pg/!ml Lei4>cptin, 4 mM FMSF, 0^% sodium deoocycfaolate). 
The vessels were dtiectty mixed and placed on ice. Tlie 
extract was incubated on ice for 15 minutes (solubilization 
st^) and oentiifuged for 15 minutes at 4** C, 1000 g. The 
supernatant was tFansfccred to a new vessel and preserves an 
ice. Cytochrosne c reductase activity was measured directly. 

To 1 nU CFR-assay mix (03M K-phoscfaate tsnfTcr^H 
7.7, 0.1 mM EDTA, 1 mM KCN) was added 2 |d 20 mM 
cytocfar<»ne C. At room tenq>erature (20°-25* C) 5-20 pi 
oeD fi^ extract was added and inrubatrd for 1—2 minutes. 
Reactions were initiated by addition of 1 |il 100 mM 
NADFH. Reduction cf cytochrome c was msCMiitored spcc- 
txopbotometxically for 3 minutes (550 nm). 

Itetein concentrations were detennined witii the Bio Rad 
protein assay kit in accordance with die instructions of the 
siqiplier, using BSA as standard. 

Table n, Cytochrome C reductase activities (e=21 
noMxm*^) of different strains/transfonnants. 

Values coming from two indcpcadent cultures are speci- 
fied. Percentages are based on the average of the di^lo 
experiments. One unit is the amount of cytochnxne C (mM) 
which is reduced per minute per mg total protein. 



StmD^ 



N2D4 
W13 
W33 



1 

26 



2.14 
2036 
8.12 



100 
960 
379 
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Estimated 
cprAcopy 






trwifufi nfniiTit 


number 


Units 


Percentage 


T18 


1 


2^ 


117 


T18#5 


&10 


54 J7 


Z550 



10 EXAMPLE XI 

BPH activity in vitro assay 

For measuring BPH activity in vitro a not yet optimized 
assay was used, based on the benzoale dependent consunq>- 
tion of NADPH by the NADPH cytochrome P450 oxi- 

15 doreductase. 

The BPH activity was measured in vitro in Aspergiilus 
ni^tfr strains T 18 (1 copy cptA^ 12 copies bphA)^ strain T18 
#S (6 cc^es cprA, 12 copies bphA), strain N204 (1 copy 
cjp^A^ 1 copy bphA) and strain W35 (1 copy bphA, muldple 

20 copy CprA) 

Mycelium was cultured for 18 hours in 250 ml minimal 
medhmi, suppiemoKted widi 0.1% CAS amino adds, 0.1 
mg^ml methioaiine and 0.1 pyridoxine. The medium 

was inoculated with 1.10^ spores per ml in 2 1 Rrienmeyer 

25 flasks in an air-agitated incubator (35^ C, 300 rpm). The 
mycelium was harvested by filtration over miradodi filter 
(CaUriocfaem) and washed widi 250 ml 0.9% NaCl The 
mycelium was transfened to 2 1 Bricnmeyer flasks filled 
with 250 ml induction medium (minimal mrdium widi 

30 pyridoxine and meddonine in ^^lich, instead of ghioose, 
0.1% benzoate was present as C-souice). Mycelium was 
harvested by filtration over miiacdotfa filtration doth. After 
washing with 0.9% NaO, excess buffer was removed by 
blotting the filter with mycelium between tissues. Tlie myce- 

35 lium was firozen in liquid nitrogen and ground in a mortar. 
The fine powder was transferre d to a n ^ppendocf reaction 
vessel filled with 1 ml ice-cold CPR-«xtraction buffer (50 
mM sodium phoqihate buffec4>H 7.8, 20% s^ycerol, 1 mM 
EOTA, 1 niM OTT, 5 Ixi4>q>tm^ 4 mM P 

40 sodium deoxocbolate). Tlie vessels were directly mixed and 
placed on ice. Tlie extract was incubated on ice for 15 
minutes (solubilization) and then centiifiiged for 15 mimites 
at 4^ C.« 3500 xpm. The supernatant was transferred to a new 
vessel and preserved on ice. 

45 BPH activity was n^easured by spectrophotametrically 
motiitccing (340 nm) the BFH specific consumption of 
NADPH by the cytodirome P450 reductase. Oil ftee 
extract (10 Ml) was added to 500 pi BPH assay buffer (100 
mM Tris/pB lA, 10 iiM MgCI^. 200 \M NAKOI). The 

50 non-specific NADPH ctMisumptioa was measured for 2 
minutes (5-BA). Benzoate was added (20 |j1 of a 20 mM 
solution) and the NADPH oonsumpti<» was measured for 4 
minutes (5-BA). BFH specific MADFH consunpticm was 
determined by the following calculati<Mi method: 
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Units (1 unit=l innd NADPH consumed per minute per 
60 mg total protein) were calculated by multiplying 6BA with 
the extinction coefficient (e=6.22.1(r^M~*cm~*). 

Protein concentrations were determined with the Bio Rad 
protein assay kit in accordance with the instructions of the 
supplier, and BSA was used as standard. 
65 T^e IV, in vitro BPH activity. Ctoe unit corxespondswi^ 
tiie benzoate dependent consuixq>tion of NADPH (^M) per 
minute per mg total protein. 
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Estimated 


Esdnuited 






Strain/ 


bphA copy 




Units 




TtaosSorwBsA 


number 


number 


PejceDtage 


N204 


1 


1 


365 


ICX) 


W13 


1 




577 


158 


T18 


12 


1 


899 


246 


T18 #5 


12 




1590 


435 
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EXAMPLE Xn 

Bph activity, in vivo HPLC assay 

My odium was cultured for 18 hours in 500 ml minimal 
medium, siq>plemeated with 0.1% CAS amino acids, 0.1 
mg/m! methionine and 0.1 }tg/mi pyridoxine. The medium 
was inoculated with 1.10^ spores per ml in 2 1 Erlenmeyer 
flasks in an air-agitated incubator (35° C, 300 rpm). Hie 
mycelium was harvested by filtration over miraclotfa filter 
(Calbiocfaem) and washed with 25 ml 0.9% Nad. Mycelium 
was subcultured in induction medium (minimal medium 
suf^emcnted with 0.1 mg^ml methionine and 0.1 (igAnl 
pyddoxine, adding, instead of 1% glucose, 0.1% 

benzoate as C-source). Samfdes of the medium were taken 
after 5 hours. 

Samples of 2 pi were analyzed by means of HPLC 
chromatography an a reversed phase C-18 column 
(Superchem LC IS-DB) at 30" using 10 mM sodium 
citrate buffei/pH 3, 60% meOianol as dution buffer. A flow 
rate of 1 ml/min. was used. Both benzoate and 4-hydroxy- 
benzoate were detected at 245 mn. Reference samples 
contained 10 pi 1 ^ooM benzoate and 10 ^1 1 mM 4-hydroxy- 
benzoate. 

TMc IV, concentrations of benzoate and 4-hydro(xybenzoate 
after incubation of transfonnants for 5 fars in induction 
mftdiiitn. Detection by means of HPLC Aspergillus mger 
can use 4-hydroxy-b«izoatc as C-sourcc and so metabolize 
further As aresult, the sums of the last two ocdumns of the 
amounts of benzoate and ^OH-boizoate are not identical. 



Stxtas/ 


bsfaa copy 
luuubcj 


Bfidmaled 
cpiAcopy 


BctiToate 
(mM) 


4-OH-BeQswlB 
(tcM) 








6.70 


OXK) 


N204 


1 


1 


1.13 


S.20 


W35 


1 


&6 


0.60 


3.63 


T18 


12 


1 


lAO 


330 


T18#5 


12 


&10 


Oj0056 


3.28 
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(Hynes et aL Mol. CelL BioL 3 (1983) 1430-1439), locate 
on a NoU fi:agment s^yproximately 5 kfo in length (the 
original ends of the chromosomal amdS firagment <EcoRI 
and SaU) had l>een r^laced with Noa sites). The ^dS gene 
was used as sdectioa marker in transfmnation^xperiments. 
Piasmid pl4 dm was constructed as follows. The yeast- 
expression plasmid YEP24 was digested with BamHI and 
SalL Between these sites was ligated an ^>proximately 2.1 
kb chromosomal (partial) BamHI-(partial) SaU fragment, on 
which the entire 14 dm gene from PeniciUium italicum was 
located (see FIG. 5). 

Transfcnnation of Aspergillus mger pCFR2-amdS 

A. mger N402 (Bos, PhD Thesis Agncuhural University 
Wageningen NL, 1986) was used as starting strain. Trans- 
formation e:q>eriments were carried out as described in 
(Kelly and Hynes EMBO J. 4 (1985) 475-479). A. niger 
N402 was transfonried wifli pCPR2-amdS. TVansforroants, 
aftear being fl^iplied with a brush to form pure cultures on 
selection plates (minimal nyHinwi plates with 15 mM CsO 
with Acetamide (10 mM) functioning as sole N-source), 
were further selected for their possibility to use acrylamid as 
sole N-source. Strains with a high amdS copy number grow 
better on acrylamid than do strains with a lower copy 
number (Verdoes et aL, Transgenic Research 2 (1993) 
84-92). 

A selected number of txansfocmants were further analyzed 
by means of Southern blot analysis. Eventually, transfor- 
mant AB2-2 (^ 10 copies cprA) was selected for further 
expcrimei^ pl4 dm 

Plasmid pl4 dm was introduced into Aspergillus niger by 
soeans of cotransformation with plasmid pAN7-l on 'w^di 
tiie fai^ gene is located, widch affords resistance to Hygio- 
mycin (Punt et aL Gene 56 (1987) 117-124). Transfcxmants 
were selected on plates with 100 pgML hygromycin. Posi- 
tive transfccmants were thereafter selected for their resis- 
tance to higher conoentratixxis of hygromycin. Transfor- 
mants that grow well at higher concentrations of 
hygromycin too, were further analyzed by means of South- 
em Blot analysis. Eventually transfocmant AB-Dl (^10 
copies 14 dm) was selected for further experiments. 

In transformant AB-Dl, in a second transf<Hination 
experiment, an attempt was made to introdu ce, in addition to 
extra 14 dm copies, extra cpcA copies. Transformant AB-Dl 
was for this purpose transformed with plasmid pCRR2- 
amdS. The further selection procedure was identical to the 
procedure followed for the isolation of transformant AB2-2. 
Eventually transformant ABDl. 15 (^ 10 copies 14 dm, ^ 10 
copies CprA) was selected for further experiments. 
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EXAMPLE Xm 

To verify the tnoad action of the cytochroaxte P450 oxi- 
doreductase gene, e3q>eriment5 were performed in whidi the 
effect of CFR overproduction on the activity of a different 
(in this exanqple even heterologous) cytochrome P450 
enzyme was tested. 

Few that purpose, Aspergillus niger strains were con- 
structed which were provided with several copies of the 
cprA gene of Aspergillus niger together with several copies 
of flie gene coding for lanosterol 14cerdemethylase (14 dm) 
firom the filammtous fungus PemdUium ituUcum. 
ConstmctiQii of plasmids 

For the introduction of the two genes, two plasmids were 
constructed. Plasmid pCPR2-amdS was constructed by 
digesting plasmid pCPSt2 with NotI and providing it with a 
functional copy of the amdS gene of Aspergillus nidulans 
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CvtixJaotDft P450 reductsae 


uutxvity 




Stmnf 


Estimftled 14DM X 


SotimafeDd^ 




Ttansfonnni 


1 copy Diunhftf 


copy fTTim^^^r 


Units 


N402 


"1" 


1 


1J05 


AB2-2 




&10 


14.6 


AB-Dl 


"1" + fcio 


1 


1.65 


AB-D1.15 


•*i" + aio 


£10 


41.7 



Table V. CFR activity in selected transformants, measured as 
NADPH d^ndent cytochrome C reduction and expressed 
in Units (mmd cytochrome c reduced per minute per mg. 
total protein). Estimated 14 DM copy number. "1" wildtype 
A. mger 14 DM gene, extra copies of the P, italicum 14 
DM gene. Lanosterol 14a-demetfaylase activity. 

Lanosterol 14a-da]Mtiiylase activity in the four different 
strains was determined in an experiment wherein the radial 
growth of the different strains in ntedia with different 
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conceatrations of inhibitor of the lanosterol 14a- 
ddDcthylase is measured as a measure for the e&zymatic 
activity. For that purpose^ mycelial plugs were provided on 
{dates with increasing concentrations of known lanosto-ol 
14a'-demethylase inhib£t(vs (DMIs). At a higher oonceotra- 
tion of the DMI, an increasingty smaller outgrowth of the 
mycelium develops. After a few days the length of the 
growaout mycelium filaments was determined. Calculated 
were the BC30 value, the concentration ai the DMI at whi<^ 
the length of ^e grown-out mycelium filaments is one-half 
of the outgrowth in the absence of DMI The DMIs used 
were Fenarimol, Btaconazcde and Imazalil. By way cf check, 
the inhibition of growth by Bcnomyl was also loolced at. 
This product inhibits mycdium growth via a different, 14 
dm independent^ mechanism. The results of these experi- 
ments are shown in Table VI and FIG. 6. 





N402 


AB2-2 


AB-Dl 


AB-D1,15 


Bstimatcd 14DM 


-1- 




-r + iio 


-r + ftio 


copy number 










Bstnmtfid cyA 


1 


&10 


1 


&10 












BoiDioyl 


2jn 


3J05 


1^ 


3.22 




100% 


134% 


85% 


142% 


Pcosrtmol 


1.73 


3^ 


UJO 


58.5S 




100% 


224% 


665% 


3270% 




OAS 


0^ 


i25 


9.10 




100% 


211% 


500% 


2022% 




6^3 


936 


1757 


66.27 




100% 


153% 


273% 


1030% 



T^le VL Lanostoxd 14a-<leiiiethy]ase inhibition (DMI) 
EC50 values expressed in ppm. Bstimatrd 14 DM copy 
number: '^l** wUdtype A mger 14 DM gene, extra copies 
of the Ualicum 14 DM gene. 
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Media and solutions 
50xAspA(l 1.) 

300 g NaNOa 

26gKa 

76 g KOHaPO^ 

18 ml KOH (lOM) 
1000 * Spore elements (100 ml) 

2J2 g 2nS04.7H20 

1.1 g H^BOa 

0.5 g Mnd^AH^O 

0.5 g FeSO^.THaO 

0.17 g Coaa-fiHaO 

0.16 g CuSO^^SHaO 

0.15 g NaaMo0^.2H20 

5.0 g EDTA 
Minimal noedium (500 ml) 

10 ml 50 * Ai^A 
10 ml 50% ^ucose 
0.5 ml 1(X)0 * spore elements 
1.0 ml IM MgSO^ 
Con^lete medium 

10 ml 50 * Aq>A 

10 ml 50% fi^uoose 

0^ ml 1(XK> * spore el^nents 

1.0 ml IM MgSO^ 

5.0 ml 10% Cas-amino acids 

25 ml 10% Yeast extract 



SBCJUBNCE USnMO 

( I )aENBRALINPO(RMUaiGN: 

( i t t )NUMBStOFS8(2UENCZS:9 

< 3 )n4P0RUAaON]CRS8Q]I>MQ; U 

( i )Sag|UBNCeCBARACIEia81IC5: 

( A >LENCrtH:37DllMMpiin 

( C ) SIRANDBDWBSS:; ■■frnw 
( D ) TOPOLOatYi imk mmm 

( i i >MOLBCOLB'nn>B:Z»<A(anoHb) 

( i i i ) HV PO m miCAL; NO 

( i T yMmSBHSBiNO 

< X i )SaQ|UENCSI3GSCSUl>TION:SBQn>NO: 1: 



TOATAACTCC TCAOCAAATC OOAOTAAACA OAAOOACAAO TCATTOOAOT ACTAAOTAOC 60 

TCCOTOTCAO AOACCCOOAC AOOATCAOCT TCTCCOAACC COAOACTCCO OOCQAAAAOG 120 

CCACCATCOC TCAOOCTACC ACCTOTOTTC CTTCCOTCOA TCOTCCTCCC TCOTTTCCOO ISO 

CTCACOOCCC CCCAAATTAT TOCOOTCTOC TTAOCAOTOO OTTCOOCCTC TCTOTTCTTC 2.40 

CTOOATCACA CCACOOCTTA CTTTCTTATC CTTTTCCTTT TCCTTTCTTC CTTTCTTCCT 3O0 

OTTCTCCTTT CTTCCTTTCC ACCCCCTTCT TTCTTTTAAC CCCATAOCOT CATTCTTTCT 360 

TCCOTTTTAT CTTTOOTTTT OOOACOCCOC CACCTTATCT COOTTCCTOC CTCOOTCTCC 420 
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OOTOATCOCA CCTOOATAOO CTAAOCOTAO OOAOOTQTOA CATTCTTCTT TCACCTCCTC 4S0 

TCCTTTTCCC OCCTCACTCC OTTCAATCCC CCOCTCCACC CTTTCAOACT COCCATCOTA 340 

TCAAOTCOOO OCCTTTOCTT OCOCCOCTOA ACAOCCTCAC CATOGCOCAA CTCOATACCC 600 

TCOATCTOOT OOTCCTOOCO OTOCTTTTOO TOOOTAOCOT OGCCTACTTC ACCAAOOOCA 660 

CCTACTOOOC AOTTOCAAAO ACCCOTATOC CTCTACCOOC CCCOCOOATO AACOGCOCCO 720 

CTAAOOCTOO CAAOACTCGO AACATCATTO AOAAOATOOA AGAAACOOOC AAOAATTOTO 7»0 

TTATTTTCTA CGOATCOCAA ACTGOAACCO CTGAOOACTA COCCTCCAOA TTGGCCAAOO »40 

AAGOATCTCA OCOCTTCOOC CTCAAOACCA TOOXaOCTOA CCTCOAGOAA TACGACTATO 900 

AOAACCTOOA CCAATTCCCO OAGOACAAOO TTGCGTTTTT CGTOCTCOCC ACCTACOOAO 960 

AOGOTOAGCC TACOOATAAT GCTGTTGAOT TCTACCAGTT CTTCACCOOT OACGACOTTG 1020 

CTTTTOAOAO COCGTCCGCO OACOAOAAOC CTCTOTCCAA OCTOAAOTAT OTTOCTTTCO 1080 

OTCTOGOTAA CAACACTTAT OAOCACTACA ACOCCATGOT TCOTCAAOTC OATGCTOCTT 1140 

TCCAGAAOCT COOOCCOCAO COTATTOOTT CTOCTGOCOA OGOTOATOAC OGTOCCOGTA 1200 

CAATOGAAGA AOACTTCTTO GCCTGOAAOG AOCCCATOTO OOCAOCACTO TCGGAOTCOA 1260 

TGOATCTCGA AOAGCGTOAA GCOOTCTACO AACCTGTTTT CTOCOTCACC OAAAACOAOT 1320 

CCCTOAOCCC TOAOOACGAO ACGOTCTATC TTOOAGAGCC CACCCAGAOC CACCTTCAOG 1380 

GTACTCCCAA AOGCCCOTAC TCTOCOCACA ACCCCTTTAT CGCCCCTATT OCCGAATCTC 1440 

OTGAGCTTTT CACCOTCAAO GATCOCAACT OTCTOCACAT OOAAATTAGC ATCOCTOGAA 1500 

GTAACTTOTC CTACCAGACT OOTOACCACA TCGCTGTTTO OCCCACAAAC OCTGOTGCCO 1560 

AAOTOGATCO OTTCCTTCAO GTCTTCOGTC TCGAOOOCAA GCOTOATTCO GTCATCAACA 1620 

TCAAGGGTAT COATOTTACO GCCAAOGTCC CAATCCCGAC CCCOACCACO TACOATOCCO 16 80 

CTOTTCGGTA CTATATGOAA OTCTOCOCCC CTOTOTCCCO TCAOTTTOTA OCCACTCTOO 1740 

CCGCOTTCOC TCCGATGAOO AAAGCAAOGC AOAOATTGTG COTCTGOOTA OCACAAOOAC I 800 

TATTTCCACO AOAAOGTCAC CAACCAATGC TTCAACATGC CCAOOCTCTT CAGAOCATCA 1860 

COTCCAAOCC TTTCTCTOCT GTTCCGTTCT CTCTGCTTAT TGAAOOCATT ACGAAOCTGC 1920 

AOCCTCOCTA CTACTCOATC TCTTCOTCCT CCCTTOTCCA GAAOOACAAO ATCAOCATCA 1980 

COOCCGTTOT OGAATCTOTT COTCTGCCCG GTOCCTCTCA CATOOTOAAG OOTOTOACTA 204O 

COAATTATCT CCTCOCOCTC AAGCAOAAOC AGAACGGOCG ATCCCTCTCC COACCCTCAC 2 100 

OGCTTGACTT ACTCCATCAC OOTCCCCOGA ACAAOTACGA COOTATCCAC OTTCCCOTOC 2 160 

ATGTTCOCCA CTCOAACTTC AAGCTGCCCT CTOATCCCTC TCOOCCCATT ATCATOOTTG 2220 

OTCCTGGTAC TOGTOTTOCT CCTTTCCOTG GTTTCATTCA GOAACOTOCT OCTTTOOCOO 2280 

CCAAGGGCOA GAAGOTTOOA CCCACTGTTC TCTTCTTCGO TTOCCOCAAO AGTGACOAOG 2340 

ATTTCTTOTA CAAOGATOAA TGGAAOGTAA OATATCTTTT TTTCTTTTCC OCAGCTACCT 2400 

TCATACATCT CGGATOCTAA CATATCOCGA TTCOCAOACC TATCAOOACC AGCTTOGAOA 2460 

CAACTTOAAG ATCATCACTO COTTCTCOCO TOAOOGTCCT CAOAAOGTCT ACGTTCAGCA 2520 

CAOACTCCGC OAOCACTCCO AACTTOTCAG CGACCTTCTG AAGCAGAAAO CTACCTTCTA 2580 

CGTCTOTGGT GACGCTGCAA ACATOGCTCG COAOOTTAAC CTTOTOCTTG OCCAOATCAT 2640 

TOCTOCOCAO COTOGTCTOC CCGCCGAOAA GOGCOAAGAA ATGGTCAAOC ACATGCOTAG 2700 

ACGTOGACGC TACCAOGAAO ATGTOTOOTC ATAATCTTTC AATOCATCOA CTTTTCTTTC 2760 

TTGTCTATCA CGACOOCCTT CTCOATCCAT TATTTTATTT AACOCCTAOA TOATCTTTGC 2820 
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ATATATACTC COCTOATTTT OCCTATTCAT CTOTTTTOCT TOOCOTOOTT TATOTATOCC 2880 

TAOTTTATTT OTTTTOTOCA CCOACCOOCC AOCCACACAT TOAAOTOOCT TOAOCATOAO 2940 

TOCOOTAOCC AOTOTCOAAA OAACAOOATA OACOATCATO ATTATTGCOO OAACATOTTA 3000 

TOCCATTCTO OOCATATTOA TATCTOOTTO CATOAOCCCA OAOOATACOA AAAOATOAAT 3060 

CCATATTTAA TTTOCACAAT ACTTTTCOCC TTCTTCATCT AOTAATTAAA TTAATTOAOC 3 120 

ACTOACCOAA COAOCTOACA CCTGCTOCTC OOAATAOCCO ACAACOCATT OACOTOCAAO 31«0 

AOATOCATAA TCATTACAAT CAACAAOTAO ACTOOTAACT AAATCACTGA ATACTACAGT 3240 

TACTOCCTAC TTTCAOCCAA AAAGTAATAC TOAAGATTTC OGOGAATCAA ATAOAAOAAA 3300 

CATOCATAAO CCCAACCTCO OCAATACCOO OAGTTAAOCA CAOTAACCAA AACCAAACCA 3360 

AACTAOAACC OGCOCGCGAC CAOTOACCCA TCOTCATTCC CGOTATCAOC AGTTCAGTCA 3420 

OACTOOCTOO CTAGCCCOAA CCCAACTOCC OCAATCATCC ATCCATCCTC AACCCOCCCC 3480 

TCCCATOCCA ACCTCTCTAC TCCGCAOAOC OAGGOACAAA AAAATOAGAT OCAOCAATTA 3540 

ACCACOATAA TCTAOCAAAA AGAAAOTTAG AAGCCOOAAO AACATACATA TCOCTTTTAC 3600 

COCTOTTCGA CTOCOACOAC OOOTCTTOAG AOCAOTTCCO CCACOTOGOC OAAAAOCTOO 3660 

ACTOCACACT ACTTACOCTA CCCTACGCTA CCTCOGTACC C 3701 

( 3 >INP0ftM«riONPOItSBQIDNQ:2: 

( i )SBQ!UENCBCBARACtERISlICS: 
( A )l 
( B )TYPE:i 
( C )S 

( D )TCraJOaT:aik»M» 
( i i )MCffjyT TIB TYPE: iioteia 
( i i i ) HTFOIHBnCAL: NO 
(IT ) ANn^SBNSB: NO 

< X i ) SBQUBNCS IseSCSlPnON: SBQ n> NO: 2: 

Mot Al4 Ola Leu Atp Thr Lev Aap Leu Val Vftl L«o Ala Val Lea Lea 

1 5 10 15 

Val Oly Sar V*l Ala Tyr Phe Thr Lyi Oly Thr Tyr Tip Ala Val Ala 

2 0 2 5 3 O 

Lys Tb r Arg Met Pto Leu Pro Ala Pro Atg Met A»a Oly Ala Ala Ly» 
3 5 4 0 4 5 

Al» Oly Lya Thr Arg Asa lie lie Oln Ly« Met OIn Gin Thr Oly Lys 
5 0 5 5 6 0 

A«a Cy» Val lie Pbe Tyr Oly Set Ola Thr Oly Thr Ala Oln Aip Tyr 
65 70 75 80 

Ala Ser Arg Len Ala Lya Ola Oly Ser Ola Arg Phe Oly Lea Ly» Thr 
8 5 9 0 9 5 

Met Val Ala Asp Leu Oln Olu Tyt Afp Tyr Ola At* Lea Acp Ola Phe 

10 0 10 5 110 

Pro Ola Asp Ly» Val Ala Phe Phe Val Leo Ala Thr Tyr Oly Oln Oly 

115 1 3 O 12 5 

Ola Pro TbT A«p Aia Ala Val Olo Phe Tyr Ola Phe Pbe Thr Oly Asp 
130 135 140 

Asp Val Ala Pbe Oln Ser Ala Ser Ala Aip Olo Ly» Pro Leo Ser Ly* 
145 150 155 160 

Leo Ly» Tyr Val Ala Phe Oly Leo Oly Aaa Asa Tbr Tyx Olo Bis Tyr 



25 



5,801,024 

^continued 



26 



Asa Ala Met 



Ax B 



Me t 
5*S 



V a 1 

1 ft 0 



Arg OIb Val Aip 



A 1 a 
t & 5 



Ala Phe Oln Lys 



O I n Ax g 



O 1 o 
3 1 0 



I 1 e 
i 9 5 



Oly Ser Ala Oly 



Aftp Phe Leo Ala 



Tx p 
2 1 5 



O 1 a 

2 2 5 



Ser Me t Asp Leo 



O 1 D 
3 3 0 



O 1 o 
2 0 0 



Ly . 

Are 



Oly Asp Aap Oty 



Ola Pro Met 



Ola Ala 



V« 1 

2 3 5 



Cy» Vat Thr Olo 



A » n 

2 4 5 



Ola Ser Leo Ser 



Pro 

2 5 0 



T r p 
2 2 0 



Ty r 
A» p 



Leo Oly Oln 



Pro 
3 6 0 



Thr Oltt Ser Hi a 



Leu 
3 6 5 



T y r Ser 



Phe 

3 9 0 



A 1 a 

2 7 5 



Bit A»B Pro Phe 



1 1 e 

2 8 0 



Thr Val Lyt A»p 



A 1 a 

3 O 5 



Oly Sex Aaa Leo 



Sex 
3 10 



Pro Thr A»a Ata 



Leu OI« Oly 



Ly • 

5 4 0 



Thr Ala 



Ly i 

3 5 5 



Ol y 
3 2 5 



Arg 
P ff o 



Ar g 

2 9 5 



T y r 
O 1 n 



Atn Cy« Lev Hi« 



OI« Thr Oly 



A . p 

3 15 



Val A • p 



Arg 

3 3 0 



A • p Sox Val 



I 1 c 
3 4 5 



A • a lie 



lie Pro 



Th r 
3 6 0 



Pro Tbr Thr 



Ar g 



Thr 
3 » J 



Ty r 

3 7 0 



T y t Me t Oln Val 



Cy a 

3 7 3 



Ala Pro Val Ser 



Leu Ala Ala Phe 



A 1 a 

3 » O 



Pro Met Arg Ly» 



A I a 

3 9 5 



Val Trp Val Ala 



Ola 
4 0 5 



Oly Lea Phe Pro 



Ar g 
4 1 O 



Ly • 
Ty r 



Ar g 
3 ft 0 



Arg 
Ol y 



Leo Ola Hi* 



A 1 a 

4 2 O 



Ola Ala L e o Ola 



Ser 
4 3 5 



Ala Val 



Arg 



Ser 
4 6 5 



Ty r 
4 3 0 



I 1 e 



Me t Val 



Ola A » a 



H i » Oly 



Pro 
4 3 5 

Ty r 

Th T 

Ly a 

Ol y 



Pro 
5 15 



Phe Ser Leo Lea 



I 1 D 

4 4 0 



Olo Oly lie Thr 



Sox 
Al a 

O 1 y 



Ar g 

5 0 0 



lie Sex 



Val 

4 7 0 



Ser 
45 5 



Sex Ser Leo Val 



Oln Sex Val Arg 



Lev 
4 7 5 



Va 1 
4 ft 5 



Thr Tbr A»a Tyr 



Leo 
4 9 0 



Sex Leo Sex Ax^ 



Pro 
5 O S 



Arg A a a 



B i a 

5 3 0 



$er A«a Phe 



Val Oly Pro Oly 



Ly a 
Ly • 



Tb r 
5 5 0 



Ty r 



Leo 
5 3 5 



Aap 

5 2 0 



Oly lie His Val 



Pro Ser Aap Pro 



Oly Val Ala Pro 



Olo Arg 
Leu P b e 
Oln Trp 



Ala Ala 



Leo 
3 6 5 



Ala Ala 



Phe 



Ly a 

5 9 3 



Ol y 
5 8 0 



Cy» Arg Ly» 



Tbr Tyr Oln Aap 



Ser 



Ola 
6 0 0 



Ol y 



A>p 

5 8 5 



O I n 
5 7 0 



O 1 a 



Phe 

5 5 5 



Ly » 



Ala 

2 0 3 



Leo O 1 
1 9 0 

Oly T h 



A I a Ala L < 



Oln Pro Va 



Ola Oly Thr Pro 



Thr V a 
2 5 



L y s O 1 
3 7 0 



Ala Pro lie Ata 



Me t 

3 0 0 



O 1 « 
3 8 5 



Olo lie 



Hi* lie Ala Vj 



Phe Leo Ola 



Val P h 

3 3 



lie A * 

3 5 0 



Oly 



A. p 

3 6 5 



Ola Phe Vi 



Olm Arg Le 



Ola P r 
4 1 



lie Tbr Ser Lya 



O 1 a 
4 6 O 



Lya 
4 4 5 



Ly a 
Ol y 



Leo Ala Leo 



Ser Ar« Leo A*p 



Pro 
5 2 5 

Ar g 
Oly 
Val Oly 
Phe L « D 



Sex 
5 4 0 



Ar g 



Leo Oly Aap Aaa 



Leo 
6 0 5 



Pro P h 
4 3 0 



Lip Ly 



Lya O 1 
4 9 

Leo L e 
5 10 

Val Hi 

Pro 11 

Phe XI 



Pro T h 
5 7 

Tyr Ly 
5 9 0 

Lya II 



Phe 
2 4 0 



Tyr 



Trp 
3 2 0 



Oly 



Cy i 
4 0 O 



Hi ft 

4 8 0 



Ly ■ 



O I a 

5 6 0 



Aip 
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... P.. Sc, X.. O.. P.o O.. ... v.. X,. v.. C. H.. A.. 

... C. H.. S., O:. v., S.. A.P ... U,. O.. ..a 

6 2 5 

. , ^ « Aiji Ars Olu V«l A»a 

,. T,. V.l CT. Ol, A.p AI. Al. A.. M. . Al. A.g 

6 4 5 



T h » Phi 



6 5 0 



LCD Val LcT» Oly OlD 
6 6 0 



lie II* Alo Ala Gl 
6 6 5 



> Ar g Oly ^* * 



L y 



. Ol. 01. M.. v.. L,. e». M.. A,, AX, Ar, O,, A,, Tyr 



6 7 5 



Ol* A»p V»l Trp Set 
6 9 0 



( 2 ) 



INPORMAnON PC3R SBQ ID NO: 3: 

( i )SBQfUEWCBCHARAOTRIOTCS: 
< A ) LENOIH: 29 I» 



( B yrm^ 

( C ) STRANDEDNBSS: 
<D)T0FOUXJY: 



( i i ) MOLECULE TWEi «l 
( i i i )HTPOnHBnCALiNO 
(IT ) AHTl-SENSB: NO 

( M. i )S8Qi™CBDBSCRlPnON:OTQn>HO:3 : 
CCOOAATTCC ARACNOONAC NOCNOAROA 



2 )INPDRMAn0NP0RSBQIDN0c4: 

< i )SBQaB*CECHARACIMSIlCS: 
( A )LBNOia 39lM»opiio 
< B >Tyi>B:Mekioaaii 
( c ) SrntANDeDHBSS: 



< i i )M01BCULErOT:o<l 
( i i i ) HtPOIBBnCAU NO 

( i w )ANI^SB^BB:NO 

< X i )SW|lIHNCBDBSCRIFI10N:SBQIDNO:4c 
CCOOAATTCO ONOANCCNAC NOAYAAYOC 



( 3 )INK»MAnONFMLCTQn>NO:5: 

< • )SBQUENCECHARACIER1SIICS: 
< A )LBNaiH:26^Me|)H» 
( B )TYre:i«*Bic«« " 
( C ) SntANESeDNE! 
( D ) TOPOUOOFY: akwiwm 

( i i i ) HYKyiOTHCAL: NO 
( i V )AKIMBNSB:NO 

< X « )SBC2UBNCaDBSCRIPnON:SBQIDNO:5: 
COCOOATCCO ONCCNAYNAD DATNAC 



( 2 )INPaRMArK]ff4PORSeQlDNO:6c 

< i )SBaUBNCBCHARACIERtSnCS: 
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( A >LENCnH:a6tu*«I»tn 
( B ) TYPE: mwkic wad 
( C ) STRA^a3EDMES$: mlcMWii 
( D )T0«>OU0OY: nnkaown 

( i i ) MCM^CULE TYPE: oih<r iMCteic add 

( i i i ) HYPOIHSnCAL: NO 

( i V >AKIVSENSE:MO 

( z i >SeC|UQ«CEDesCIUmON: SBQIDNO:6: 
OCOOOATCCT STTONACRTA NACYTT 26 



( 2 > mFORMAnON POft SBQ ID NO: 7: 

( i >SBQUQ«CECBARACIERI9nCS: 
( A > LENCnH: 26 hmo {Mm 
( B ) TYPBt hkUc acU 
( C ) SmtANQSDNBSS: ookacnm 
( D >TOI>OUX}Y:isiAwM 

( i i ) MnCBTULF TYPE: atba mtekac maa 

( i t i > HYPOEHEnCAL: NO 

{ i T )ANI^^SN5S,NO 

( X i ) SBQIUBNCBOBSCRIFTIOK: SBQ ID NO: 7: 
COCOOATCCO ONCCDATCAT DATNAC 26 



( 2 )INR0ftM^KnaNPOR WmNOtS: 

( i >SB«]ENCECBARACrBltl5nCS: 
( A )LENaiB: 13taMeSMin 
( B ) TYPES md B k- «cid 
( C > SIftANDBDNBSS: mkaowm 
( D ) TOPOUOOY: QateMn 

( i i ) MOT nriTIB TYPE: dbg MKlefcaAi 

( i i i ) OYPGriBBnCAL: NO 

( i ▼ )ANXVSBNSK:NO 

< X I >5SQUQ4CEZ3eSCIlIPnON:S8Qn>NOe& 



( 2 )im)BMArEGNFQ&SSQIDNQ:9: 

( i ) SBQUBNCB CHARACi lttUam CS: 
( A yiOKTIBi 14tMe|Mn 
( B ) Type.' mekic mad 
( C )SnUNDeDeiBSS:iakMnn 
( D )TOPOCXXnfiii^Knn 

( i i >MOLBCUL£ TYPE: QtevcMc add 

(lit > HYPOXHSnCAL: NO 

( i ▼ >AMn-SENSB:NO 

(x i )SBgtJENCeDeSCItXPIION: SBQIDNO:9: 



Wc claim: 

1. A lecombinant DNA molecule compdang one of 
(a) a Dudeic acid sequence encodiiig a cytoduome P450 

oxidoreductase having an amino add sequence as ^ 

shown in SEQ ID NO:2, 



(b> a nadeic add sequence which hybridizes under strin- 
gency condilioD5 of SO"" C. and 6K SSC with the 
nuddc add sequence of (a); 

(c) a nudeic add sequence complementary with the 
nuddc add sequence of (a), and 
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(d) a Ducicic acid sequence complementary with the 
nucleic acid sequence of (b). 

2. An RNA nK>lecule coding for a cytochrome P450 
oxidoreductase, which is derived from a DNA molecule 
accco'ding to claim 1. 

3. A polypeptide having cytochrome P450 oxidoreductasc 
activity, said polypeptide being coded for by a recombinant 
DNA molecule accoErding to claim 1. 

4. A transformed host cell which is at least transformed 
with a DNA molecule according to claim 1 and which is 
derived from a filamentous fungus. 

5. A host cell according to claim 4 which is d^ved from 
a filamentCHis asoomycete. 

6. A host ceU according to claim 5 which is derived from 
the genus Aspergillus. 

7. A host ccU according to claim 6 which is derived from 
Aspergillus Niger. 

8. A host cell according to claim 4 which is further 
transformed with a DNA nwlecule which codes for a p 450 
cytochrome protein. 

9. A host cell according to daim 8, wlierein the P450 
cytochrome protein is coded for by one of 

(a) a DNA molecule as shown in SEQ ID NO:l, and 

(b) a DNA molecule v/iuch hybridizes under moderately 
stiingait conditions with a DNAnKdeciile according to 
SEQ ID NO:l. 

1#. A process for enzymatic conversion of a substrate, 
oon^irislng contacting the substrate with a filamentoos fun- 
gus c3Q>able of converting said substrate and ofstionally 
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recovering the oonversicMi product formed, wherein said 
filamentous fungus is a recombinant filamentous fungus 
transformed with and expressing a recombinant DNA mol- 
ecule according to claim 1. 

3 11. A process for enzymatic conversion of a substrate, 
conqsrising contacting tiie substrate with an enzyme capable 
of converting said substrate and optionally recovering the 
coDversion product formed, wherein said enzyme is a cyto- 
chrome P450 oxidofcductase according to claim 3. 

^° 12. A recoihbinant DNA molecule conquising one of 

(a) a part of nucleic acid sequence SEQ ID NO:l that 
encodes a cytochr(»iie P450 ooddoreductase, 

(b) a nucleic add sequence which hybridizes under strin- 
15 gency conditions of 56** C. and 6X SSC with the 

nucleic add sequence of (a), 

(c) a nucleic add sequoice complementary with the 
nucleic add sequence of (a), and 

(d) a nucleic add sequence conqiementary with the 
^ nucleic add sequence of (b). 

13. A process fox enzymatic conversion of a substrate, 
comprising contacting the substrate with a filamentous fun- 
gus capable of converting said substrate and optionally 
recovering the oonversi(m product formed, wherein said 
^ filamentous fungus is a recombinant filamentous fungus 
transformed with and expressing a recombinant DNA md- 
ecule according to daim 12. 

***** 



